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Prologue 


When people today hear the term “information revolution,” they typically think 
of silicon chips and software code, cellular phones and supercomputers. They 
rarely think of tiny one-celled organisms or the rise of animal life. But, while 
writing these words in the summer of 2012, I am sitting at the end of a narrow 
medieval street in Cambridge, England, where more than half a century ago a 
far-reaching information revolution began in biology. This revolution was 
launched by an unlikely but now immortalized pair of scientists, Francis Crick 
and James Watson. Since my time as a Ph.D. student at Cambridge during the 
late 1980s, I have been fascinated by the way their discovery transformed our 
understanding of the nature of life. Indeed, since the 1950s, when Watson and 
Crick first illuminated the chemical structure and information-bearing properties 
of DNA, biologists have come to understand that living things, as much as high- 
tech devices, depend upon digital information—information that, in the case of 
life, is stored in a four-character chemical code embedded within the twisting 
figure of a double helix. 

Because of the importance of information to living things, it has now become 
apparent that many distinct “information revolutions” have occurred in the 
history of life—not revolutions of human discovery or invention, but revolutions 
involving dramatic increases in the information present within the living world 
itself. Scientists now know that building a living organism requires information, 
and building a fundamentally new form of life from a simpler form of life 
requires an immense amount of new information. Thus, wherever the fossil 
record testifies to the origin of a completely new form of animal life—a pulse of 
biological innovation—it also testifies to a significant increase in the information 
content of the biosphere. 

In 2009, I wrote a book called Signature in the Cell about the first 
“information revolution” in the history of life—the one that occurred with the 
origin of the first life on earth. My book described how discoveries in molecular 
biology during the 1950s and 1960s established that DNA contains information 
in digital form, with its four chemical subunits (called nucleotide bases) 


functioning like letters in a written language or symbols in a computer code. And 
molecular biology also revealed that cells employ a complex information- 
processing system to access and express the information stored in DNA as they 
use that information to build the proteins and protein machines that they need to 
stay alive. Scientists attempting to explain the origin of life must explain how 
both information-rich molecules and the cell’s information-processing system 
arose. 

The type of information present in living cells—that is, “specified” 
information in which the sequence of characters matters to the function of the 
sequence as a whole—has generated an acute mystery. No undirected physical or 
chemical process has demonstrated the capacity to produce specified information 
starting “from purely physical or chemical” precursors. For this reason, chemical 
evolutionary theories have failed to solve the mystery of the origin of first life— 
a claim that few mainstream evolutionary theorists now dispute. 

In Signature in the Cell, I not only reported the well-known impasse in 
origin-of-life studies; I also made an affirmative case for the theory of intelligent 
design. Although we don’t know of a material cause that generates functioning 
digital code from physical or chemical precursors, we do know—based upon our 
uniform and repeated experience—of one type of cause that has demonstrated 
the power to produce this type of information. That cause is intelligence or mind. 
As information theorist Henry Quastler observed, “The creation of information 
is habitually associated with conscious activity.”! Whenever we find functional 
information—whether embedded in a radio signal, carved in a stone monument, 
etched on a magnetic disc, or produced by an origin-of-life scientist attempting 
to engineer a self-replicating molecule—and we trace that information back to 
its ultimate source, invariably we come to a mind, not merely a material process. 
For this reason, the discovery of digital information in even the simplest living 
cells indicates the prior activity of a designing intelligence at work in the origin 
of the first life. 

My book proved controversial, but in an unexpected way. Though I clearly 
stated that I was writing about the origin of the first life and about theories of 
chemical evolution that attempt to explain it from simpler preexisting chemicals, 
many critics responded as if I had written another book altogether. Indeed, few 
attempted to refute my book’s actual thesis that intelligent design provides the 
best explanation for the origin of the information necessary to produce the first 
life. Instead, most criticized the book as if it had presented a critique of the 
standard neo-Darwinian theories of biological evolution—theories that attempt 


to account for the origin of new forms of life from simpler preexisting forms of 
life. Thus, to refute my claim that no chemical evolutionary processes had 
demonstrated the power to explain the ultimate origin of information in the DNA 
(or RNA) necessary to produce life from simpler preexisting chemicals in the 
first place, many critics cited processes at work in already living organisms—in 
particular, the process of natural selection acting on random mutations in already 
existing sections of information-rich DNA. In other words, these critics cited an 
undirected process that acts on preexistent information-rich DNA to refute my 
argument about the failure of undirected material processes to produce 
information in DNA in the first place.* 

For example, the eminent evolutionary biologist Francisco Ayala attempted to 
refute Signature by arguing that evidence from the DNA of humans and lower 
primates showed that the genomes of these organisms had arisen as the result of 
an unguided, rather than intelligently designed, process—even though my book 
did not address the question of human evolution or attempt to explain the origin 
of the human genome, and even though the process to which Ayala alluded 
clearly presupposed the existence of another information-rich genome in some 
hypothetical lower primate.° 

Other discussions of the book cited the mammalian immune system as an 
example of the power of natural selection and mutation to generate new 
biological information, even though the mammalian immune system can only 
perform the marvels it does because its mammalian hosts are already alive, and 
even though the mammalian immune system depends upon an elaborately 
preprogrammed form of adaptive capacity rich in genetic information—one that 
arose long after the origin of the first life. Another critic steadfastly maintained 
that “Meyer’s main argument” concerns “the inability of random mutation and 
selection to add information to [preexisting] DNA”“ and attempted to refute the 
book’s presumed critique of the neo-Darwinian mechanism of biological 
evolution accordingly. 

I found this all a bit surreal, as if I had wandered into a lost chapter from a 
Kafka novel. Signature in the Cell simply did not critique the theory of 
biological evolution, nor did it ask whether mutation and selection can add new 
information to preexisting information-rich DNA. To imply otherwise, as many 
of my critics did, was simply to erect a straw man. 

To those unfamiliar with the particular problems faced by scientists trying to 
explain the origin of life, it might not seem obvious why invoking natural 
selection does not help to explain the origin of the first life. After all, if natural 


selection and random mutations can generate new information in living 
organisms, why can it also not do so in a prebiotic environment? But the 
distinction between a biological and prebiotic context was crucially important to 
my argument. Natural selection assumes the existence of living organisms with a 
capacity to reproduce. Yet self-replication in all extant cells depends upon 
information-rich proteins and nucleic acids (DNA and RNA), and the origin of 
such information-rich molecules is precisely what origin-of-life research needs 
to explain. That’s why Theodosius Dobzhansky, one of the founders of the 
modern neo-Darwinian synthesis, can state flatly, “Pre-biological natural 
selection is a contradiction in terms.”° Or, as Nobel Prize-winning molecular 
biologist and origin-of-life researcher Christian de Duve explains, theories of 
prebiotic natural selection fail because they “need information which implies 
they have to presuppose what is to be explained in the first place.”° Clearly, it is 
not sufficient to invoke a process that commences only once life has begun, or 
once biological information has arisen, to explain the origin of life or the origin 
of the information necessary to produce it. 

All this notwithstanding, I have long been aware of strong reasons for 
doubting that mutation and selection can add enough new information of the 
right kind to account for large-scale, or “macroevolutionary,” innovations—the 
various information revolutions that have occurred after the origin of life. For 
this reason, I have found it increasingly tedious to have to concede, if only for 
the sake of argument, the substance of claims I think likely to be false. 

And so the repeated prodding of my critics has paid off. Even though I did 
not write the book or make the argument that many of my critics critiqued in 
responding to Signature in the Cell, I have decided to write that book. And this is 
that book. 

Of course, it might have seemed a safer course to leave well enough alone. 
Many evolutionary biologists now grudgingly acknowledge that no chemical 
evolutionary theory has offered an adequate explanation of the origin of life or 
the ultimate origin of the information necessary to produce it. Why press a point 
you never made in the first place? 

Because despite the widespread impression to the contrary—conveyed by 
textbooks, the popular media, and spokespersons for official science—the 
orthodox neo-Darwinian theory of biological evolution has reached an impasse 
nearly as acute as the one faced by chemical evolutionary theory. Leading 
figures in several subdisciplines of biology—cell biology, developmental 
biology, molecular biology, paleontology, and even evolutionary biology—now 


openly criticize key tenets of the modern version of Darwinian theory in the 
peer-reviewed technical literature. Since 1980, when Harvard paleontologist 
Stephen Jay Gould declared that neo-Darwinism “is effectively dead, despite its 
persistence as textbook orthodoxy,”” the weight of critical opinion in biology has 
grown steadily with each passing year. 

A steady stream of technical articles and books have cast new doubt on the 
creative power of the mutation and selection mechanism.® So well established 
are these doubts that prominent evolutionary theorists must now periodically 
assure the public, as biologist Douglas Futuyma has done, that “just because we 
don’t know how evolution occurred, does not justify doubt about whether it 
occurred.”? Some leading evolutionary biologists, particularly those associated 
with a group of scientists known as the “Altenberg 16,” are openly calling for a 
new theory of evolution because they doubt the creative power of the mutation 
and natural selection mechanism.!° 

The fundamental problem confronting neo-Darwinism, as with chemical 
evolutionary theory, is the problem of the origin of new biological information. 
Though neo-Darwinists often dismiss the problem of the origin of life as an 
isolated anomaly, leading theoreticians acknowledge that neo-Darwinism has 
also failed to explain the source of novel variation without which natural 
selection can do nothing—a problem equivalent to the problem of the origin of 
biological information. Indeed, the problem of the origin of information lies at 
the root of a host of other acknowledged problems in contemporary Darwinian 
theory—from the origin of new body plans to the origin of complex structures 
and systems such as wings, feathers, eyes, echolocation, blood clotting, 
molecular machines, the amniotic egg, skin, nervous systems, and 
multicellularity, to name just a few. 

At the same time, classical examples illustrating the prowess of natural 
selection and random mutations do not involve the creation of novel genetic 
information. Many biology texts tell, for example, about the famous finches in 
the Galapagos Islands, whose beaks have varied in shape and length over time. 
They also recall how moth populations in England darkened and then lightened 
in response to varying levels of industrial pollution. Such episodes are often 
presented as conclusive evidence for the power of evolution. And indeed they 
are, depending on how one defines “evolution.” That term has many meanings, 
and few biology textbooks distinguish between them. “Evolution” can refer to 
anything from trivial cyclical change within the limits of a preexisting gene pool 
to the creation of entirely novel genetic information and structure as the result of 


natural selection acting on random mutations. As a host of distinguished 
biologists have explained in recent technical papers, small-scale, or 
“microevolutionary,” change cannot be extrapolated to explain large-scale, or 
“macroevolutionary,” innovation.!! For the most part, microevolutionary 
changes (such as variation in color or shape) merely utilize or express existing 
genetic information, while the macroevolutionary change necessary to assemble 
new organs or whole body plans requires the creation of entirely new 
information. As an increasing number of evolutionary biologists have noted, 
natural selection explains “only the survival of the fittest, not the arrival of the 
fittest.”!* The technical literature in biology is now replete with world-class 
biologists’? routinely expressing doubts about various aspects of neo-Darwinian 
theory, and especially about its central tenet, namely, the alleged creative power 
of the natural selection and mutation mechanism. 

Nevertheless, popular defenses of the theory continue apace, rarely if ever 
acknowledging the growing body of critical scientific opinion about the standing 
of the theory. Rarely has there been such a great disparity between the popular 
perception of a theory and its actual standing in the relevant peer-reviewed 
scientific literature. Today modern neo-Darwinism seems to enjoy almost 
universal acclaim among science journalists and bloggers, biology textbook 
writers, and other popular spokespersons for science as the great unifying theory 
of all biology. High-school and college textbooks present its tenets without 
qualification and do not acknowledge the existence of any significant scientific 
criticism of it. At the same time, official scientific organizations—such as the 
National Academy of Sciences (NAS), the American Association for the 
Advancement of Sciences (AAAS), and the National Association of Biology 
Teachers (NABT)—routinely assure the public that the contemporary version of 
Darwinian theory enjoys unequivocal support among qualified scientists and that 
the evidence of biology overwhelmingly supports the theory. For example, in 
2006 the AAAS declared, “There is no significant controversy within the 
scientific community about the validity of the theory of evolution.”!“ The media 
dutifully echo these pronouncements. As New York Times science writer 
Cornelia Dean asserted in 2007, “There is no credible scientific challenge to the 
theory of evolution as an explanation for the complexity and diversity of life on 
earth.” 

The extent of the disparity between popular representations of the status of 
the theory and its actual status, as indicated in the peer-reviewed technical 
journals, came home to me with particular poignancy as I was preparing to 


testify before the Texas State Board of Education in 2009. At the time the board 
was considering the adoption of a provision in its science education standards 
that would encourage teachers to inform students of both the strengths and 
weaknesses of scientific theories. This provision had become a political hot 
potato after several groups asserted that “teaching strengths and weaknesses” 
were code words for biblical creationism or for removing the teaching of the 
theory of evolution from the curriculum. Nevertheless, after defenders of the 
provision insisted that it neither sanctioned teaching creationism nor censored 
evolutionary theory, opponents of the provision shifted their ground. They 
attacked the provision by insisting that there was no need to consider weaknesses 
in modern evolutionary theory because, as Eugenie Scott, spokeswoman for the 
National Center for Science Education, insisted in The Dallas Morning News, 
“There are no weaknesses in the theory of evolution.”!® 

At the same time, I was preparing a binder of one hundred peer-reviewed 
scientific articles in which biologists described significant problems with the 
theory—a binder later presented to the board during my testimony. So I knew— 
unequivocally—that Dr. Scott was misrepresenting the status of scientific 
opinion about the theory in the relevant scientific literature. I also knew that her 
attempts to prevent students from hearing about significant problems with 
evolutionary theory would have likely made Charles Darwin himself 
uncomfortable. In On the Origin of Species, Darwin openly acknowledged 
important weaknesses in his theory and professed his own doubts about key 
aspects of it. Yet today’s public defenders of a Darwin-only science curriculum 
apparently do not want these, or any other scientific doubts about contemporary 
Darwinian theory, reported to students. 

This book addresses Darwin’s most significant doubt and what has become of 
it. It examines an event during a remote period of geological history in which 
numerous animal forms appear to have arisen suddenly and without evolutionary 
precursors in the fossil record, a mysterious event commonly referred to as the 
“Cambrian explosion.” As he acknowledged in the Origin, Darwin viewed this 
event as a troubling anomaly—one that he hoped future fossil discoveries would 
eventually eliminate. 

The book is divided into three main parts. Part One, “The Mystery of the 
Missing Fossils,” describes the problem that first generated Darwin’s doubt—the 
missing ancestors of the Cambrian animals in the earlier Precambrian fossil 
record—and then tells the story of the successive, but unsuccessful, attempts that 
biologists and paleontologists have made to resolve that mystery. 


Part Two, “How to Build an Animal,” explains why the discovery of the 
importance of information to living systems has made the mystery of the 
Cambrian explosion more acute. Biologists now know that the Cambrian 
explosion not only represents an explosion of new animal form and structure but 
also an explosion of information—that it was, indeed, one of the most significant 
“information revolutions” in the history of life. Part Two examines the problem 
of explaining how the unguided mechanism of natural selection and random 
mutations could have produced the biological information necessary to build the 
Cambrian animal forms. This group of chapters explains why so many leading 
biologists now doubt the creative power of the neo-Darwinian mechanism and it 
presents four rigorous critiques of the mechanism based on recent biological 
research. 

Part Three, “After Darwin, What?” evaluates more current evolutionary 
theories to see if any of them explain the origin of form and information more 
satisfactorily than standard neo-Darwinism does. Part Three also presents and 
assesses the theory of intelligent design as a possible solution to the Cambrian 
mystery. A concluding chapter discusses the implications of the debate about 
design in biology for the larger philosophical questions that animate human 
existence. As the story of the book unfolds, it will become apparent that a 
seemingly isolated anomaly that Darwin acknowledged almost in passing has 
grown to become illustrative of a fundamental problem for all of evolutionary 
biology: the problem of the origin of biological form and information. 

To see where that problem came from and why it has generated a crisis in 
evolutionary biology, we need to begin at the beginning: with Darwin’s own 
doubt, with the fossil evidence that elicited it, and with a clash between a pair of 
celebrated Victorian naturalists—the famed Harvard paleontologist Louis 
Agassiz and Charles Darwin himself. 


Part One 


The Mystery of the Missing Fossils 
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Darwin’s Nemesis 


When Charles Darwin finished his famous book, he thought that he had 
explained every clue but one. 

By anyone’s measure, On the Origin of Species was a singular achievement. 
Like a great Gothic cathedral, the ambitious work integrated many disparate 
elements into a grand synthesis, explaining phenomena in fields as diverse as 
comparative anatomy, paleontology, embryology, and biogeography. At the same 
time, it was impressive for its simplicity. Darwin’s Origin explained many 
classes of biological evidence with just two central organizing ideas. The twin 
pillars of his theory were the ideas of universal common ancestry and natural 
selection. 

The first of these pillars, universal common ancestry, represented Darwin’s 
theory of the history of life. It asserted that all forms of life have ultimately 
descended from a single common ancestor somewhere in the distant past. In a 
famous passage at the end of the Origin, Darwin argued that “all the organic 
beings which have ever lived on this earth have descended from some one 
primordial form.”! Darwin thought that this primordial form gradually 
developed into new forms of life, which in turn gradually developed into other 
forms of life, eventually producing, after many millions of generations, all the 
complex life we see in the present. 

Biology textbooks today usually depict this idea just as Darwin did, with a 
great branching tree. The trunk of Darwin’s tree of life represents the first 
primordial organism. The limbs and branches of the tree represent the many new 
forms of life that developed from it (see Fig. 1.1). The vertical axis on which the 
tree is plotted represents the arrow of time. The horizontal axis represents 
changes in biological form, or what biologists call “morphological distance.” 
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FIGURE 1.1 


Darwin’s evolutionary tree of life, as depicted by the nineteenth-century German 
evolutionary biologist Ernst Haeckel. 


Biologists often call Darwin’s theory of the history of life “universal common 
descent” to indicate that every organism on earth arose from a single common 
ancestor by a process of “descent with modification.” Darwin argued that this 
idea best explained a variety of biological evidences: the succession of fossil 
forms, the geographical distribution of various species (such as Galapagos 
finches), and the anatomical and embryological similarities among otherwise 
highly distinct organisms. 

The second pillar of Darwin’s theory affirmed the creative power of a process 
he called natural selection, a process that acted on random variations in the traits 
or features of organisms and their offspring. Whereas the theory of universal 
common descent postulated a pattern (the branching tree) to represent the 


history of life, Darwin’s idea of natural selection referred to a process that he 
said could generate the change implied by his branching tree of life. 

Darwin formulated the idea of natural selection by analogy to a well-known 
process, that of “artificial selection” or “selective breeding.” Anyone in the 
nineteenth century familiar with the breeding of domestic animals—dogs, 
horses, sheep, or pigeons, for example—knew that human breeders could alter 
the features of domestic stock by allowing only animals with certain traits to 
breed. A sheepherder from the north of Scotland might breed for a woollier 
sheep to enhance its chances of survival in a cold northern climate (or to harvest 
more wool). To do so, he would choose only the woolliest males and woolliest 
ewes to breed. If generation after generation he continued to select and breed 
only the woolliest sheep among the resulting offspring, he would eventually 
produce a woollier breed of sheep. In such cases, “the key is man’s power of 
accumulative selection,” wrote Darwin. “Nature gives successive variations; 
man adds them up in certain directions useful to him.”? 

Darwin noted that pigeons have been coaxed into a dizzying variety of 
breeds: the carrier, with its elongated eyelids and a “wide gape of mouth”; the 
“short-faced tumbler,” with its “beak in outline almost like that of a finch”; the 
common tumbler, with its penchant for flying in close formation and “tumbling 
in the air head over heels”; and, perhaps strangest of all, the pouter, with its 
elongated legs, wings, and body overshadowed by its “enormously developed 
crop, which it glories in inflating” for its astonished patrons.* 

Of course, pigeon breeders achieved these startling metamorphoses by 
carefully sifting and selecting. But, as Darwin pointed out, nature also has a 
means of sifting: defective creatures are less likely to survive and reproduce, 
while those offspring with beneficial variations are more likely to survive, 
reproduce, and pass on their advantages to future generations. In the Origin, 
Darwin argued that this process, natural selection acting on random variations, 
could alter the features of organisms just as intelligent selection by human 
breeders can. Nature itself could play the role of the breeder. 

Consider once more our flock of sheep. Imagine that instead of a human 
selecting the woolliest males and ewes to breed, a series of very cold winters 
ensures that all but the very woolliest sheep in a population die. Now again only 
very woolly sheep will remain to breed. If the cold winters continue over several 
generations, will the result not be the same as before? Won’t the population of 
sheep eventually become discernibly woollier? 

This was Darwin’s great insight. Nature—in the form of environmental 


changes or other factors—could have the same effect on a population of 
organisms as the intentional decisions of an intelligent agent. Nature would favor 
the preservation of certain features over others—specifically, those that 
conferred a functional or survival advantage upon the organisms possessing 
them—causing the features of the population to change. And the resulting 
change will have been produced not by an intelligent breeder choosing a 
desirable trait or variation—not by “artificial selection”—but by a wholly natural 
process. What’s more, Darwin concluded that this process of natural selection 
acting on randomly arising variations had been “the chief agent of change” in 
generating the great branching tree of life in all its variety. 

On the Origin of Species seized the attention of the scientific community like 
a thunderclap. Darwin’s analogy to artificial selection was powerful, his 
proposed mechanism of natural selection and random variation easily grasped, 
and his skill in dispensing with potential objections unrivalled. Moreover, the 
explanatory scope of his argument for universal common descent constituted 
something of a tour de force. By the close of the Origin, it seemed to many that 
Darwin had dispensed with every conceivable objection to his theory but one. 


THE ANOMALY: DARWIN’S DOUBT 


Despite the scope of his synthesis, there was one set of facts that troubled 
Darwin—something he conceded his theory couldn’t adequately explain, at least 
at present. Darwin was puzzled by a pattern in the fossil record that seemed to 
document the geologically sudden appearance of animal life in a remote period 
of geologic history, a period that at first was commonly called the Silurian, but 
later came to be known as the Cambrian. 

During this geological period, many new and anatomically sophisticated 
creatures appeared suddenly in the sedimentary layers of the geologic column 
without any evidence of simpler ancestral forms in the earlier layers below, in an 
event that paleontologists today call the Cambrian explosion. Darwin frankly 
described his concerns about this conundrum in the Origin: “The difficulty of 
understanding the absence of vast piles of fossiliferous strata, which on my 
theory were no doubt somewhere accumulated before the Silurian [i.e., 
Cambrian] epoch, is very great,” he wrote. “I allude to the manner in which 
numbers of species of the same group suddenly appear in the lowest known 
fossiliferous rocks.”° The sudden appearance of animals so early in the fossil 
record did not easily accord with Darwin’s new theory of gradual evolutionary 


change, and there was one scientist who would not let him forget it. 


THE ANTAGONIST 


Swiss-born paleontologist Louis Agassiz, of Harvard University, was one of the 
best-trained scientists of his age, and he knew the fossil record better than any 
man alive. Hoping to enlist Agassiz as an ally, Darwin sent him a copy of On the 
Origin of Species and asked him to consider the argument with an open mind 
(see Fig. 1.2). One can almost see the great, aging naturalist receiving the 
unremarkable package from the postman, unwrapping the small green volume 
that had stirred such a tempest on both sides of the Atlantic. Perhaps he retired to 
his study the better to concentrate, scrutinizing the book’s prepossessing title, 
recalling what he had already heard about the work. He read the book with deep 
interest, making notes in the margin as he moved through it, but in the end his 
verdict would disappoint its author. Agassiz concluded that the fossil record, 
particularly the record of the explosion of Cambrian animal life, posed an 
insuperable difficulty for Darwin’s theory. 








FIGURE 1.2 
Figure 1.2a (left): Louis Agassiz. Figure 1.2b (right): Charles Darwin. 


THE TWO-PRONGED CHALLENGE 


To see why, consider brachiopods and trilobites, two of the best-documented 
creatures in the Cambrian fossil record by 1859. The brachiopod (see Fig. 1.3), 
with its two shells, looks like a clam or an oyster, but is very different inside. As 
shown in the accompanying figure, it possesses a gonad, mantle, mantle cavity, 
anterior body wall, body cavity, gut, and lophophore, the last of which is a 


feeding organ like a ring of tentacles, usually in the shape of a coil or horseshoe, 
with a mouth inside the ring of tentacles, and an anus outside. The brachiopod 
exhibits a highly complex overall body plan, with many individually complex 
and functionally integrated anatomical systems and parts. Its tentacles, for 
instance, are covered by cilia precisely arranged to generate and direct a current 
of water toward the mouth.® 
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FIGURE 1.3 

Figure 1.3a (top): Brachiopod internal anatomy. Figure 1.3b (bottom, left): 
Brachiopod fossil showing remains of internal structure. Courtesy Paul Chien. 
Figure 1.3c (bottom, right): Fossil showing exterior structure of brachiopod 
shell. Courtesy Corbis. 





MIDDLE OR AXIAL LOBE 








GILL 
CUTAWAY 
vIEw LEGS 
| COMPOUND 
HAL Sa 
EPHALON —_ 
‘ (HEAD) » AXTAL 
FURROW 
PLEURA 
ToP 
VIEW THORAX —— LOBE 
PLEURAL 


FURROW 


PLEURAL 


PYGIDIUM — SPINE 


(TAIL) Ze 


Lijit ty 
PLEURAL CENTRAL PLEURAL 
LOBE AXIS LoBE 








FIGURE 1.4 


Figure 1.4a (top): Trilobite anatomy. Figure 1.4b (bottom): Trilobite fossil of the 
species Kuanyangia pustulosa. Courtesy Illustra Media. 


Even more sophisticated was the trilobite (see Fig. 1.4), with its three 
longitudinal lobes across its head (a raised middle lobe and a flatter pleural lobe 
to either side) and a body divided into three parts—head, chest, and tail, the 
former two consisting of as many as thirty segments. It had a pair of legs for 
every pleural groove and another three pairs for the head. Most dramatic of all 
were the compound eyes found on even some of the very early trilobites—eyes 
that afforded these not so primitive animals a 360-degree field of vision.’ 

The abrupt appearance of such complex anatomical designs presented a 
challenge to each of the two main parts of Darwin’s theory of evolution. 


THE CAMBRIAN EXPLOSION AND THE ACTION OF NATURAL 
SELECTION 


The Cambrian fossil evidence represented a significant challenge to Darwin’s 
claim that natural selection had the capacity to produce novel forms of life. As 
Darwin described it, the ability of natural selection to produce significant 
biological change depends upon the presence of three distinct elements: (1) 
randomly arising variations, (2) the heritability of those variations, and (3) a 
competition for survival, resulting in differences in reproductive success among 
competing organisms. 

According to Darwin, variations in traits arise randomly. Some variations 
(such as thicker fleece) might confer advantages in the competition for survival 
in particular environmental conditions. Those variations that are heritable and 
that impart functional or survival advantage will be preserved in the next 
generation. As nature “selects” these successful variations, the features of a 
population change. 

Darwin conceded that the beneficial variations responsible for permanent 
change in species are both rare and necessarily modest. Major variations in 
forms, what later evolutionary biologists would term “macromutations,” 
inevitably produce deformity and death. Only minor variations meet the test of 
viability and heritability. 

It followed that, over human timescales, the benefits of this evolutionary 
mechanism would be difficult or impossible to spot. But given enough time, 
favorable variations would gradually accumulate and give rise to new species 
and, given more time, even fundamentally new groups of organisms and body 
designs. If artificial selection could conjure so many strange breeds from a wild 
strain in a few centuries, Darwin argued, imagine what natural selection could 
achieve over many millions of years. Even the origin of complex structures such 
as the mammalian eye—which seemed at first to present a significant challenge 
to his theory—could be explained if one postulated the existence of an initially 
simpler structure (such as a light sensitive spot) that could be gradually modified 
over long periods of time. 

And that was the rub. Darwin’s mechanism of natural selection and random 
variation necessarily required a lot of time to generate wholly novel organisms, 
creating a dilemma that Agassiz was keen to expose. 

In an 1874 Atlantic Monthly essay titled “Evolution and the Permanence of 
Type,” Agassiz explained his reasons for doubting the creative power of natural 


selection. Small-scale variations, he argued, had never produced a “specific 
difference” (i.e., a difference in species). Meanwhile, large-scale variations, 
whether achieved gradually or suddenly, inevitably resulted in sterility or death. 
As he put it, “It is a matter of fact that extreme variations finally degenerate or 
become sterile; like monstrosities they die out.”® 

Darwin himself insisted that the process of evolutionary change he 
envisioned must occur very gradually for the same reason. Thus, Darwin 
realized that building, for instance, a trilobite from single-celled organisms by 
natural selection operating on small, step-by-step variations would require 
countless transitional forms and failed biological experiments over vast stretches 
of geologic time. As University of Washington paleontologist Peter Ward would 
later explain, Darwin had very specific expectations for what paleontologists 
would find below the lowest known strata of animal fossils—in particular, 
“intervening strata showing fossils of increasing complexity until finally 
trilobites appeared.”’ As Darwin noted, “If my theory be true, it is indisputable 
that before the lowest Silurian [Cambrian] stratum was deposited, long periods 
elapsed, as long as, or probably far longer than, the whole interval from the 
Silurian age to the present day; and that during these vast, yet quite unknown, 
periods of time, the world swarmed with living creatures.” !° 

The mechanism of natural selection necessarily had to work gradually on 
small incremental variations. And, indeed, the kinds of variations that Darwin 
actually observed and described in developing his analogy between natural and 
artificial selection were in every case minor. Only by selecting and accumulating 
minor variations over many generations were breeders able to produce the 
striking changes in the features of a breed, changes that were, nevertheless, 
extraordinarily modest compared to the radical differences in form between, say, 
Precambrian and Cambrian forms of life. At the end of the day, as Agassiz 
hastened to note, the pigeons Darwin cited in support of the creative power of 
artificial and, by analogy, natural selection were still pigeons. More significant 
changes to the form and anatomical structure of organisms would, by the logic of 
Darwin’s mechanism, require untold millions of years, precisely what seemed 
unavailable in the case of the Cambrian explosion. 


THE CAMBRIAN EXPLOSION AND THE TREE OF LIFE 


The abrupt appearance of the Cambrian fauna also posed a separate but related 
difficulty for Darwin’s picture of a continuously branching tree of life. To 


produce truly novel animal forms, the Darwinian mechanism would—by its own 
internal logic—require not only millions of years, but untold generations of 
ancestors. Thus, even the discovery of a handful of plausible intermediates 
allegedly linking a Precambrian ancestor to a Cambrian descendant wouldn’t 
come close to fully documenting Darwin’s picture of the history of life. If 
Darwin is right, Agassiz argued, then we should find not just one or a few 
missing links, but innumerable links shading almost imperceptibly from alleged 
ancestors to presumed descendants. Geologists, however, had found no such 
myriad of transitional forms leading to the Cambrian fauna. Instead, the 
stratigraphic column seemed to document the abrupt appearance of the earliest 
animals. 

Agassiz thought the evidence of abrupt appearance, and the absence of 
ancestral forms in the Precambrian, refuted Darwin’s theory.!! Of these earlier 
forms, Agassiz asked, “Where are their fossilized remains?” He insisted that 
Darwin’s picture of the history of life “contradict[ed] what the animal forms 
buried in the rocky strata of our earth tell us of their own introduction and 
succession upon the surface of the globe. Let us therefore hear them;—for, after 
all, their testimony is that of the eye-witness and the actor in the scene.” 


MURCHISON, SEDGWICK, AND THE CAMBRIAN FOSSILS OF 
WALES 


Darwin, for his part, responded with more than civility. Far from dismissing 
Agassiz, he conceded that his objection carried considerable force. Nor was 
Agassiz alone in pressing these concerns. Other leading naturalists thought the 
fossil evidence presented a significant obstacle to Darwin’s theory. At the time, 
perhaps the best place to investigate the lowest known strata of fossils was 
Wales, and one of its leading experts was Roderick Impey Murchison, who 
named the earliest geologic period the Silurian after an ancient Welsh tribe. Five 
years before On the Origin of Species, he called attention to the sudden 
appearance of complex designs like the compound eyes of the first trilobites, 
creatures already thriving at the apparent dawn of animal life. For him, this 
discovery ruled out the idea that these creatures had evolved gradually from 
some primitive and relatively simple form: “The earliest signs of living things, 
announcing as they do a high complexity of organization, entirely exclude the 
hypothesis of a transmutation from lower to higher grades of being.” !° 

The other pioneering explorer of Wales’s rich fossil record, Adam Sedgwick, 


also thought that Darwin had leaped beyond the evidence, as he told him in a 
letter in the fall of 1859: “You have deserted—after a start in that tram-road of 
all solid physical truth—the true method of induction.”!4 Sedgwick might have 
had in mind the same evidence the two men had studied together some twenty- 
eight years before when the Cambridge professor had brought Darwin along as 
his field assistant to explore, in the Upper Swansea Valley in northwestern 
Wales, the very strata that seemed to testify so powerfully to the sudden 
appearance of animal life. It was these strata that Sedgwick named after a 
Latinized English term for the country of Wales—“Cambria,” a designation that 
eventually replaced “Silurian” as the name for the earliest strata of animal 
fossils. 
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FIGURE 1.5 
Three organisms that first appear in the Ordivician period: eurypterans (sea 
scorpions), starfish, and tetracoral. 





Sedgwick emphasized that these Cambrian animal fossils appeared to pop out 
of nowhere into the geological column. But he also stressed what he viewed as a 
broader reason to doubt Darwin’s evolutionary model: the sudden appearance of 
the Cambrian animals was merely the most outstanding instance of a pattern of 
discontinuity that extends throughout the geologic column. Where in the 
Ordovician strata, for instance, are many of the families of the trilobites and 
brachiopods present in the Cambrian just below it?! These creatures along with 


numerous other types suddenly disappear. But just as suddenly one finds 
newcomers in the Ordovician strata like the eurypterans (sea scorpions), starfish, 
and tetracorals (see Fig. 1.5).!° In a later Paleozoic period called the Devonian, 
the first amphibians (e.g., Ichthyostega) arise. Much later, many staples of the 
Paleozoic era (which encompasses the Cambrian, Ordovician, and four 
subsequent periods) suddenly go extinct in a period called the Permian.!” Then, 
in the Triassic period that follows, completely novel animals such as turtles and 
dinosaurs emerge.!® Such discontinuity, Sedgwick argued, is not the exception, 
but the rule. 


DATING BY DISCONTINUITY 


Already by Sedgwick’s time, the various strata of fossils had proved so distinct 
one from another that geologists had come to use the sharp discontinuities 
between them as a key means for dating rocks. Originally, the best tool for 
determining the relative age of various strata was based on the notion of 
superposition. Put simply, unless there is a reason to believe otherwise, a 
geologist provisionally assumes that lower rocks were put down before the rocks 
above them. Now, contrary to a widespread caricature, no respected geologist, 
then or now, adopts this method uncritically. The most basic training in geology 
teaches that rock formations can be twisted, upended, even mixed pell-mell by a 
variety of phenomena. This is why geologists have always looked for other 
means to estimate the relative age of different strata. 

In 1815, Englishman William Smith had hit upon just such an alternative 
means.!? While studying the distinct fossil strata exposed during canal 
construction, Smith noted that so dissimilar are the fossil types among different 
major periods and so sharp and sudden the break between them, that geologists 
could use this as one method for determining the relative age of strata. Even 
when layers of geological strata are twisted and turned, the clear discontinuities 
between the various strata often allow geologists to discern the order in which 
they were deposited, particularly when there is a broad enough sampling of rich 
geological sites from the period under investigation to study and cross-reference. 
Although not without its pitfalls, this approach has become a standard dating 
technique, used in conjunction with superposition and other more recent 
radiometric dating methods.*° 
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Indeed, it’s difficult to overemphasize how central the approach is to modern 
historical geology. As Harvard paleontologist Stephen Jay Gould explains, it is 
the phenomenon of fossil succession that dictates the names of the major periods 
in the geological column (see Fig. 1.6). “We might take the history of modern 
multi-cellular life, about 600 million years, and divide this time into even and 
arbitrary units easily remembered as 1—12 or A-L, at 50 million years per unit,” 
Gould writes. “But the earth scorns our simplifications, and becomes much more 
interesting in its derision. The history of life is not a continuum of development, 
but a record punctuated by brief, sometimes geologically instantaneous, episodes 
of mass extinction and subsequent diversification.”*! The question that Darwin’s 
early critics posed was this: How could he reconcile his theory of gradual 
evolution with a fossil record so discontinuous that it had given rise to the names 


of the major distinct periods of geological time, particularly when the first 
animal forms seemed to spring into existence during the Cambrian as if from 
nowhere? 


A SOLUTION UNSEEN 


Of course, Darwin was well aware of these problems. As he noted in the Origin, 
“The abrupt manner in which whole groups of species suddenly appear in certain 
formations has been urged by several paleontologists—for instance, by Agassiz, 
Pictet, and Sedgwick—as a fatal objection to the belief in the transmutation of 
species. If numerous species, belonging to the same genera or families, have 
really started into life all at once, the fact would be fatal to the theory of descent 
with slow modification through natural selection.”** Darwin, however, proposed 
a possible solution. He suggested that the fossil record may be significantly 
incomplete: either the ancestral forms of the Cambrian animals were not 
fossilized or they hadn’t been found yet. “I look at the natural geological record, 
as a history of the world imperfectly kept, and written in a changing dialect,” 
Darwin wrote. “Of this history we possess the last volume alone, relating only to 
two or three countries. Of this volume, only here and there a short chapter has 


been preserved; and of each page, only here and there a few lines. . . . On this 
view, the difficulties above discussed are greatly diminished, or even 
disappear.”*° 


Darwin himself was less than satisfied with this explanation.** Agassiz, for 
his part, would have none of it. “Both with Darwin and his followers, a great part 
of the argument is purely negative,” he wrote. They “thus throw off the 
responsibility of proof. ... However broken the geological record may be, there 
is a complete sequence in many parts of it, from which the character of the 
succession may be ascertained.” On what basis did he make this claim? “Since 
the most exquisitely delicate structures, as well as embryonic phases of growth 
of the most perishable nature, have been preserved from very early deposits, we 
have no right to infer the disappearance of types because their absence disproves 
some favorite [i.e., Darwinian] theory.”*° 

Though Darwin himself was less than enthusiastic about his response to 
Agassiz’s objection, it seemed adequate to satisfy the needs of the moment. The 
overwhelming preponderance of evidence that Darwin had marshaled seemed to 
support his theory. In any case, many leading naturalists—Joseph Hooker, 
Thomas Huxley, Ernst Haeckel, and Asa Gray—all younger than Agassiz, 
quickly aligned themselves with his evolutionary line of thinking. True, some 
scientists, notably the Scottish engineering professor Fleeming Jenkin and (later) 
the English geneticist William Bateson, expressed persistent doubts about the 


efficacy of natural selection. But despite the views of some weighty scientific 
critics, Darwin’s revolutionary theory won increasingly wide support and soon 
defined the terms of the debate about the history of life. Those who rejected it 
wholesale, as Agassiz did, consigned themselves to increasing irrelevance. 


AGASSIZ UNDER THE MICROSCOPE 


So did Agassiz identify a genuine problem for Darwin’s theory, a mystery, at 
least, waiting to be solved? If so, whatever became of this problem? And if not, 
how could such a brilliant and knowledgeable scientist, someone so steeped in 
the evidence, fall so far outside the mainstream of scientific opinion? 

Historians of science in the post-Darwinian era have typically attempted to 
answer this later question by portraying Agassiz as a brilliant and respected 
scientist who nevertheless was too ossified to catch the new wave, a figure past 
his prime and mired in philosophical prejudice.*° Biographer Edward Lurie 
describes the Harvard naturalist as a “giant of the nineteenth century . . . a person 
deeply involved in his surroundings, a man who understood the possibilities of 
life with an uncommon awareness.” Similarly, historian Mabel Robinson says 
that she long awaited a biography of Agassiz that “would re-create this man of 
genius and his headlong splendid race through life.” He was, she said, “a man to 
remember because genius is rare,” “an immortal Pied Piper.”*® These scholars 
are merely echoing what Agassiz’s contemporaries, even Darwin himself, said. 
“What a set of men you have at Harvard!” Darwin told the American poet Henry 
Wadsworth Longfellow. “Both our universities put together cannot furnish the 
like. Why, there is Agassiz—he counts for three.”*? 

Even so, many historians argue that Agassiz was too infected by German 
idealism to properly assess the factual basis of Darwin’s case. According to 
idealist philosophers of biology, living forms exemplified transcendent ideas and 
in their organization provided evidence of purposive design in nature. Comments 
historian A. Hunter Dupree, “Agassiz’s idealism was of course the basis of his 
concepts of species and their distribution,” of his insistence that a divine or 
intellectual cause must stand behind the origin of each type.*° The ship of 
science was transitioning from idealism to modern empiricism. Agassiz had 
fallen overboard, since he had imbibed too deeply an outmoded idealism from 
his teacher, the French anatomist Georges Cuvier, and from philosophers like 
Friedrich Schelling, who “ran wild in trying to put all nature into a unified and 
absolute system of ideas.”°! Agassiz wasn’t merely wrong, Dupree explains, but 
an annoying obscurantist, actively fighting “against the extension of empiricism 
into natural history.”°? 

Edward Lurie offers a similar if somewhat more nuanced assessment: 
although “quite capable of making the most admirable scientific discoveries 


reflecting complete devotion to scientific method,’ Agassiz “would then 
interpret the data through the medium of what seemed to be the most absurd 
metaphysics.”°? The very man who made “the most careful, exact, and precise 
descriptions” of the natural world would, in his generalizations from those 
observations, “indulge in flights of idealistic fancy.”°* In short, Lurie thought 
that “Agassiz’s cosmic philosophy shaped his entire reaction to the evolution 
idea.”°° 

As science advanced in the late nineteenth century, it increasingly excluded 
appeals to divine action or divine ideas as a way of explaining phenomena in the 
natural world. This practice came to be codified in a principle known as 
methodological naturalism. According to this principle, scientists should accept 
as a working assumption that all features of the natural world can be explained 
by material causes without recourse to purposive intelligence, mind, or 
conscious agency. 

Proponents of methodological naturalism argue that science has been so 
successful precisely because it has assiduously avoided invoking creative 
intelligence and, instead, searched out strictly material causes for previously 
mysterious features of the natural world. In the 1840s, the French philosopher 
August Comte argued that science progresses through three distinct phases. In its 
theological phase, it invokes the mysterious action of the gods to explain natural 
phenomena, whether thunderbolts or the spread of disease. In a second, more 
advanced metaphysical stage, scientific explanations refer to abstract concepts 
like Plato’s forms or Aristotle’s final causes. Comte taught that science only 
reaches maturity when it casts aside such abstractions and explains natural 
phenomena by reference to natural laws or strictly material causes or processes. 
Only in this third and final stage, he argued, can science achieve “positive” 
knowledge. 

During the late nineteenth century, scientists increasingly embraced this 
“positivistic” vision.°° Agassiz, by insisting that the Cambrian fossils pointed to 
“acts of mind”?’’ and an “intervention of an intellectual power,” stood firmly 
against this new vision. For many, his reference to the work of a transcendent 
mind merely demonstrated that he was unable to abandon an outmoded idealistic 
approach. The train of scientific progress had left Agassiz behind. 


AN OLD FOSSIL RECOVERED 


Though clearly Agassiz did reject the principle of methodological naturalism, as 
it is now named, there are problems with portraying him as a fossil of another 
age. First, Agassiz was unsurpassed in his commitment to the empirical method. 
It is Agassiz about whom the story is told of the professor instructing one of his 
students to observe a fish for three arduous days, a story iconic enough that it is 
reprinted in freshman composition textbooks. In the story, the student, Samuel 
Scudder, pulls out his hair trying to see anything new about the slimy creature, 
wondering why Professor Agassiz is torturing him with this “hideous fish.” But 
in the end Scudder breaks through to new levels of observational depth and 
precision. Mabel Robinson notes that if such teaching methods seem less 
revolutionary to contemporary readers than they did to Scudder, that’s because 
Agassiz trained an army of able young naturalists who took his method to other 
universities, and they in turn passed them on to their students, themselves future 
professors.?® 

William James, the founder of American pragmatism, extolled Agassiz’s 
commitment to empirical rigor in a letter he wrote to his father while on an 
expedition with Agassiz in 1865 to South America. In the letter the young man 
commented that he felt a “greater feeling of weight and solidity about the 
presence of this great background of special facts than about the mind of any 
other man I know,”°? a storehouse of precise data made possible by “a rapidity 
of observation, and a capacity to recognize them again and remember everything 
about them.” James would eventually enter the field of psychology, but he took 
with him the empirical approach to problem solving that Agassiz had modeled so 
impressively.*! 

As Lurie concedes, Agassiz’s stature among American scientists grew out of 
his unrivaled knowledge of geology, paleontology, ichthyology, comparative 
anatomy, and taxonomy. So passionate was Agassiz for the particulars of the 
natural world that he began organizing a system of information-sharing among 
naturalists, sailors, and missionaries around the world. He collected more than 
435 barrels of specimens, among them an extremely rare group of fossil plants.* 
In a single year, Agassiz amassed more than 91,000 specimens and identified 
close to 11,000 new species,*? making Harvard’s natural history museum 
preeminent among such museums in the world. 

He also appears to have gone to great lengths, literally and figuratively, to 


assess On the Origin of Species empirically, going so far as to make a research 
voyage retracing Darwin’s trip to the Galapagos Islands. As he explained to 
German zoologist Carl Gegenbauer, he “wanted to study the Darwin theory free 
from all external influences and former prejudices.”“4 

The idea that religious or philosophical prejudice compromised Agassiz’s 
scientific judgment raises other questions. As historian Neal Gillespie explains, 
Agassiz was “second to no man in his opposition to sectarian religious 
interference with science.”4° Moreover, Agassiz showed himself perfectly 
willing to accept natural mechanisms where before supernatural intervention had 
been the preferred explanation. Since he regarded material forces, and the laws 
of nature that described them, as the products of an underlying design plan, he 
Saw any creative work they did as deriving ultimately from a creator. For 
instance, he assumed this was the case with the development of embryos: he 
attributed their natural evolution from zygote to adult as a natural phenomenon 
and considered this no threat to his belief in a creator.*° He also readily accepted 
the notion of a naturally evolving solar system.*” He thought a skillful cosmic 
architect could work through secondary natural causes every bit as effectively as 
through direct acts of agency. The marginalia in his copy of On the Origin of 
Species suggest that he had this same attitude concerning biological evolution. 
“What is the great difference,” he wrote, “between supposing that God makes 
variable species or that he makes laws by which species vary?” 

A third problem with the official portrait of Darwin’s chief rival concerns 
Lurie’s suggestion that Agassiz was a master of particulars, but not of 
generalizing from those particulars. The historical record suggests otherwise. For 
example, Agassiz was the man who ably generalized from a wide array of 
particular clues in his work on the ice age, winning over the geological 
establishment by demonstrating how a range of facts were best explained by the 
action of retreating glaciers. 

Here a direct comparison between Darwin and Agassiz is possible. Each 
searched for an explanation of a curious geological phenomenon in the Scottish 
Highlands, the parallel roads of Glen Roy. Glen Roy is the valley of the River 
Roy and, although it’s a place of breathtaking beauty, what visitors found most 
intriguing about it over the years were its three parallel roads that wind along the 
canyon wall on either side of the river (see Fig. 1.7).4° Scottish legend held that 
they were hunting paths built for use by early Scottish kings or perhaps even for 
the mythical warrior Fingal. Scientists later argued that the roads were natural 
rather than artificial. Darwin and Agassiz were both convinced that natural 


processes were the cause, but they nevertheless arrived at different explanations. 
What was the end of the matter? In his autobiography, Darwin explained, 
“Having been deeply impressed with what I had seen of the elevation of the land 
in S. America, I attributed the parallel lines to the action of the sea; but I had to 
give up this view when Agassiz propounded his glacier-lake theory.”4° 
Subsequent investigations in the late nineteenth and early twentieth centuries 
confirmed that Agassiz’s interpretation was the correct one.?? 

Agassiz, then, was far more than just a walking encyclopedia or an insatiable 
gatherer of fossils who couldn’t see the proverbial forest for the trees. Those 
who insist otherwise can point to but one example to support their position, 
namely, his rejection of Darwin’s theory; but they cannot use that example to 
establish his general inability to interpret evidence and then turn around and use 
that supposed inability to explain his failure to accept Darwin’s theory. That is to 
argue in a circle. 

There is a far more obvious solution to the historical puzzle posed by the 
great Agassiz’s objection to Darwin’s theory: the fossils of the Cambrian strata 
do, in fact, arise abruptly in the geological record, in clear defiance of what 
Darwin’s theory would lead us to expect. In short, a genuine mystery is at hand. 
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FIGURE 1.7 
Parallel roads of Glen Roy. 


Two final considerations lend support to this view. First, as already noted, 
Darwin himself accepted the validity of Agassiz’s objection.°' As he 
acknowledged elsewhere in the Origin, “To the question why we do not find rich 
fossiliferous deposits belonging to these assumed earliest periods prior to the 
Cambrian system, I can give no satisfactory answer. ... The case at present must 
remain inexplicable; and may be truly urged as a valid argument against the 
views here entertained.”°? 

Second, Darwin’s attempt to account for the absence of the expected fossil 
ancestors of the Cambrian forms failed to address the full strength and subtlety 
of Agassiz’s objection. As Agassiz explained, the problem with Darwin’s theory 
was not just the general incompleteness of the fossil record or even a pervasive 
absence of ancestral forms of life in the fossil record. Rather, the problem, 
according to Agassiz, was the selective incompleteness of the fossil record. 

Why, he asked, does the fossil record always happen to be incomplete at the 
nodes connecting major branches of Darwin’s tree of life, but rarely—in the 
parlance of modern paleontology—at the “terminal branches” representing the 
major already known groups of organisms? These terminal branches were well 
represented (see Fig. 1.8), often stretching over many generations and millions 
of years, while the “internal branches” at the connecting nodes on Darwin’s tree 
of life were nearly always—and selectively—absent. As Agassiz explained, 
Darwin’s theory “rests partly upon the assumption that, in the succession of ages, 
just those transition types have dropped out from the geological record which 
would have proved the Darwinian conclusions had these types been 
preserved.”°° To Agassiz, it sounded like a just-so story, one that explains away 
the absence of evidence rather than genuinely explaining the evidence we have. 

Was there any easy answer to Agassiz’s argument? If so, beyond his stated 
willingness to wait for future fossil discoveries, Darwin didn’t offer one. 





FIGURE 1.8 

The vertical lines in these diagrams represent known animal phyla. The dots 
within the vertical lines represent animals from those phyla that have been found 
fossilized in different strata. The diagram on the left shows the animal tree of life 
as expected based upon Darwinian theory. The diagram on the right shows a 
simplified representation of the actual pattern of the Precambrian—Cambrian 
fossil record. Notice that fossils representing the internal branches and nodes, 
but not the terminal branches, are missing. 


AN ENDURING MYSTERY 


In the years immediately following the publication of On the Origin of Species, 
many of Agassiz’s concerns were temporarily swept aside as public and 
scientific fascination with Darwin’s ideas grew. Even so, a persistent mystery lay 
at the feet of biologists, one that subsequent generations of scientists would 
revisit and repeatedly seek to resolve. As Darwin noted, in his time, the fossils of 
the Cambrian were relatively few, and the period of the explosion only vaguely 
understood. But perhaps future scientists would come to his rescue with fresh 
discoveries. 

The story of the successive attempts to solve the Cambrian mystery stretches 
from Darwin’s time to the present, and from the Swansea Valley in southern 
Wales to remote fossil sites in southern China. In the next chapter, the work of 
detection moves from the late nineteenth century to the early twentieth, from the 
British Isles to British Columbia, and to a fossil site above the Kicking Horse 
River so astonishing that, even today, paleontologists and some of the most 
skeptical and hardened of scientific rationalists speak its name with childlike 
reverence. 


2 
The Burgess Bestiary 


Only in fiction can we expect such fine orchestration of setting and dramatic 
action. Gothic tales haunted by demons of the past have their thunderstorms and 
crumbling mansions, the existentialist novel its disorienting cityscapes, the 
romance its unattainable balconies laced with jasmine. In ordinary life, the 
Staging is usually less precise. Intricate family tragedies unravel in tidy suburban 
ranch-style homes, while enchanted romances blossom over cubicle walls. But 
the twentieth century’s most revolutionary fossil discovery was more like fiction: 
the setting was commensurate with the moment. 

Photographs taken during the summer’s expedition show a lean and balding 
man with pleasant crinkles at the corners of his eyes and a deep thought line 
slashing down between his brows; he stands precariously on rocky ascents, pick 
and hoe at hand, gazing far into the distance from a stony peak, at ease among 
the forbidding slopes and treacherous ridges. Working his way over one ridge 
and then above the tree line of the next, Charles Doolittle Walcott reached a 
place where he could see for miles. To the northwest, the crude arrowhead of 
Mount Wapta jutted skyward. Below lay Emerald Lake, its waters green from 
the mineral-rich glacial till. To the east and west, snowy peaks stretched to the 
horizon (see Figs. 2.1 and 2.2). Only the view to the northeast lacked a vista. 
Here was the homely shale of a barren ridge. Of course, as in any fairy tale, there 
lay the real prize, a hidden vista measured not in miles, but in ages. 








FIGURE 2.1 
The scenery of the Burgess Shale and surrounding area. Courtesy Corbis. 











FIGURE 2.2 
Charles Doolittle Walcott in the field (c. 1911). Courtesy Smithsonian Institution 
Archives. 


Walcott, already the director of the Smithsonian Institution, was about to 
enter the most significant phase of his professional life. More than this, he was 
about to make perhaps the most dramatic discovery in the history of 
paleontology, a rich trove of middle Cambrian—era fossils, including many 
previously unknown animal forms, preserved in exquisite detail, suggesting an 
event of greater suddenness than had been known even in Darwin’s time and 
detailing a greater diversity of biological form and architecture than had hitherto 
been imagined. 

Where did this wealth of biological form come from, and why, again, did it 
seem to arise so suddenly during the Cambrian period? Walcott was the first to 
explore the Burgess Shale, and he would be the first to suggest an answer to the 


questions it raised. 


THE BESTIARY 


Among paleontologists, the fateful clue that led to the Burgess Shale’s discovery 
is the stuff of legend. Paleontologist Stephen Jay Gould considered it to have 
been rendered best in an obituary of Charles Walcott written by Walcott’s former 
research assistant, Charles Schuchert: 


One of the most striking of Walcott’s faunal discoveries came at the end of 
the field season of 1909, when Mrs. Walcott’s horse slid on going down the 
trail and turned up a slab that at once attracted her husband’s attention. 
Here was a great treasure—wholly strange Crustacea of Middle Cambrian 
time—but where in the mountain was the mother rock from which the slab 
had come? Snow was even then falling, and the solving of the riddle had 
to be left to another season, but next year the Walcotts were back again on 
Mount Wapta, and eventually the slab was traced to a layer of shale—later 
called the Burgess Shale—3000 feet above the town of Field.‘ 


Gould quotes the legend to celebrate its archetypal appeal even as he debunks 
it: “Consider the primal character of this tale—the lucky break provided by the 
slipping horse, . . . the greatest discovery at the very last minute of a field season 
(with falling snow and darkness heightening the drama of finality), the anxious 
wait through a winter of discontent, the triumphant return and careful, 
methodical tracing of errant block to mother lode.”* A compelling story, Gould 
concludes, but pure fiction. Walcott’s own diaries reveal that his team had plenty 
of time to begin excavating the site that very summer amid cooperative weather 
and even warm nights. As for their return the following summer, locating the 
mother lode was apparently the work of a single day rather than a full week, a 
conclusion Gould drew from both Walcott’s diaries and his knowledge of 
Walcott’s expertise as a geologist.° 

The motifs of the lucky break, the frustrating delay, and the final and 
fortuitous triumph will resurface later (see Chapter 7) as a tall tale of Gould’s 
own, but for now consider only the scientific community’s weakness for staging 
the Burgess discovery with various fictional props, as if the stunning scenery 
around it were not setting enough. This weakness for theater is understandable, 
considering what Walcott and later investigators found there. Over the next 
several years, Walcott’s team alone collected more than 65,000 specimens, many 


of them astonishingly well preserved, some so bizarre that paleontologists would 
cast about for more than half a century for the proper categories in which to 
contain them. 

Consider just one odd couple from Walcott’s quarry, Marrella and 
Hallucigenia. Marrella, also called a lace crab, is an unusual form. Walcott 
described it as a type of trilobite, but later studies by Cambridge paleontologist 
Harry Whittington classified it not as a trilobite, nor a chelicerate (the subgroup 
of arthropods that includes spiders), and not even as a crustacean, but rather as a 
fundamentally distinct form of arthropod.* The creature is divided into twenty- 
six segments, each with a jointed leg for walking and a feathery gill branch for 
swimming. Its head shield has two long pairs of spikes directed backwards, and 
the underside of the head features two pairs of antennae. One is short and stout, 
the other long and sweeping (see Fig. 2.3). 

Hallucigenia belongs to a genus and family of one. It has a rounded mass at 
one end (possibly the head) connected to a cylinder-shaped trunk sporting seven 
pairs of spines projecting upward and to either side, each of them almost as long 
as the trunk itself (see Fig. 2.4). On the underside of the creature are seven pairs 
of limbs, each corresponding in position to one of the pairs of spines on the 
back, though with the tentacle farthest back offset. The underbelly also features 
three pairs of shorter tentacles before the trunk tapers and curves upward in what 
was probably a flexible extension from the body. Each of the larger tentacles 
appears to have a hollow tube connected to the gut and a pincer at the tip. This 
ancient creature was so peculiar that paleontologists feigned disbelief at what 
they saw, giving it its memorable name. 








FIGURE 2.3 

Figure 2.3a (left): Artist rendering of Marrella splendens. Figure 2.3b (right): 
Photograph of Marrella splendens fossil. Courtesy Wikimedia Commons, user 
Smith609. 








FIGURE 2.4 


Figure 2.4a (top): Artist rendering of Hallucigenia sparsa. Figure 2.4b (bottom): 
Photograph of Hallucigenia sparsa fossil. Courtesy Smithsonian Institution. 


The term “Cambrian explosion” was to become common coin, because 
Walcott’s site suggested the geologically abrupt appearance of a menagerie of 
animals as various as any found in the gaudiest science fiction. During this 
explosion of fauna, representatives of about twenty of the roughly twenty-six 
total phyla present in the known fossil record made their first appearance on 
earth (see Fig. 2.5).° 

The term “phyla” (singular: “phylum”) refers to divisions in the biological 
classification system. The phyla constitute the highest (or widest) categories of 
biological classification in the animal kingdom, with each exhibiting a unique 
architecture, organizational blueprint, or structural body plan. Familiar examples 
of phyla are cnidarians (corals and jellyfish), mollusks (squid and clams), 


echinoderms (sea stars and sea urchins), arthropods (trilobites and insects), and 
the chordates, to which all vertebrates including humans belong. 

The animals within each phylum exhibit distinguishing features that enable 
taxonomists to divide and group them further into other, progressively smaller 
divisions, beginning with classes and orders, and eventually coming to families, 
genera, and individual species. The broadest and highest categories within the 
animal kingdom—such as phyla and classes—designate the major categories of 
animal life, typically designating unique body plans. Lower taxonomic 
categories—like genus and species—designate smaller degrees of difference 
among organisms that typically exemplify similar overall ways of organizing 
their body parts and structures. 

Throughout the book I will use these conventional categories of 
classification, as do most Cambrian paleontologists. Nevertheless, I am aware 
that some paleontologists and systematists (experts in classification) today prefer 
“phylogenetic classification,’ a method that often uses a “rank-free” 
classification scheme.® Advocates of modern phylogenetic classification argue 
that the traditional classification system lacks objective criteria by which to 
decide whether a certain group of organisms should be assigned a particular rank 
of, for example, phylum or class or order.’ Proponents of rank-free classification 
attempt to eliminate subjectivity in classification (and ranking) by grouping 
together animals that are thought, based upon studies of similar molecules in 
different groups, to share a common ancestor. This method of classification 
treats groups that emerge at roughly the same time on the tree of life as 
equivalent. Nevertheless, even proponents of phylogenetic classification often 
use the conventional taxonomic categories in their technical discussions of 
specific organisms because of their common scientific usage. So despite my own 
sympathy with some of the concerns of rank-free advocates (see below), I have 
chosen to do the same. 
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FIGURE 2.5 

Figure 2.5a (top): Chart showing when representatives of the different animal 
phyla first appeared in the fossil record. According to Darwinian theory, 
differences in biological form should increase gradually, steadily increasing the 
number of distinct body plans and phyla, over time. References for first 
appearances are found in note 5 of this chapter. Figure 2.5b (bottom, left) 
expresses that expectation graphically, showing the number of new phyla 
increasing steadily as members of one phylum diversify and give rise to new 
phyla. Figure 2.5c (bottom, right) shows the actual pattern of first appearance 
showing a spike in the number of phyla that first appear in the Cambrian, 
followed by either few or no new phyla arising in subsequent periods of 
geological history. 


In any case, it’s worth noting that using a rank-free classification system does 
not minimize the mystery of the Cambrian explosion. The Cambrian explosion 


presents a puzzle for evolutionary biologists, not just because of the number of 
phyla that arise, but rather because of the number of unique animal forms and 
structures that arise (as measured, perhaps, by the number of phyla)—however 
biologists decide to classify them. Thus, whether scientists decide to use newer 
rank-free classification schemes or older, more conventional, Linnaean 
categories, the “evolutionary novelties”—that is, the new anatomical structures 
and modes of organization—that arise suddenly with the Cambrian animals 
remain as facts of the fossil record, requiring explanation. (For an expanded 
technical discussion of these issues, go to this endnote.)® 

One especially dramatic fact of the Cambrian explosion is the first 
appearance of many novel marine invertebrate animals (representatives of 
separate invertebrate? phyla, subphyla, and classes in the traditional 
classification scheme). Some of these animals have mineralized exoskeletons, 
including those representing phyla, such as echinoderms, brachiopods, and 
arthropods, and each represent clearly distinct and novel body plans. Further, 
these are just three of dozens of novel body plans exemplified by the Burgess 
animals—animals in which both soft and hard parts are well preserved (see Fig. 
2.6). 
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FIGURE 2.6 
Representatives of some of the major animal groups that first appeared in the 
sedimentary rock record during the Cambrian period. 


The variety in the Burgess Shale was so extreme it took several decades for 
paleontologists to grasp it fully. Walcott, for instance, attempted to fit all of the 


new forms into existing phyla. However, even in the midst of this attempt, he 
realized that this revolutionary quarry posed a problem more fundamental than a 
need to tidy up the existing taxonomy. He had met Louis Agassiz at a young age, 
having sold him some of his first fossils, and later described him as “a guide in 
whom I could trust and follow,” one in whose work “I find this tribute to the 
Great Mind that created the objects of his study.”'° But in the great debate 
between Agassiz and Darwin, Walcott sided with the Englishman. Thus, the 
Burgess Shale struck Walcott as not only fascinating, but puzzling. 


A PUZZLING PATTERN 


Over the years, as paleontologists have reflected on the overall pattern of the 
Precambrian—Cambrian fossil record in light of Walcott’s discoveries, they too 
have noted several features of the Cambrian explosion that are unexpected from 
a Darwinian point of view!! in particular: (1) the sudden appearance of 
Cambrian animal forms; (2) an absence of transitional intermediate fossils 
connecting the Cambrian animals to simpler Precambrian forms; (3) a startling 
array of completely novel animal forms with novel body plans; and (4) a pattern 
in which radical differences in form in the fossil record arise before more minor, 
small-scale diversification and variations. This pattern turns on its head the 
Darwinian expectation of small incremental change only gradually resulting in 
larger and larger differences in form. 


THE MISSING TREE 


Figures 2.7 and 2.8 illustrate the difficulty posed by the first two of these 
features: sudden appearance and missing intermediates. These diagrams graph 
morphological change over time. The first shows the Darwinian expectation that 
changes in morphology should arise only as tiny changes accumulate. This 
Darwinian commitment to gradual change through microevolutionary variations 
produces the classic representation of evolutionary history as a branching tree. 

Now compare this branching tree pattern with the pattern in the fossil record. 
The bottom part of Figure 2.7 and Figure 2.8 show that the Precambrian strata do 
not document the expected transitional intermediates between Cambrian and 
Precambrian fauna. Instead, the Precambrian—Cambrian fossil record, especially 
in light of the Burgess Shale after Walcott, points to the geologically sudden 
appearance of complex and novel body plans. 
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FIGURE 2.7 

The origin of animals. Darwinian theory (top) predicts gradual evolutionary 
change in contrast to the fossil evidence (bottom), which shows the abrupt 
appearance of the major animal groups. 
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FIGURE 2.8 

According to Darwinian theory, the strata beneath the Cambrian rocks should 
evidence many ancestral and intermediate forms. Such forms have not been 
found for the vast majority of animal phyla. These anticipated but missing forms 
are represented by the gray circles. Lines and dark circles depict fossilized 
representatives of phyla that have been found. 


Of course, the fossil record does show an overall increase in the complexity 
of organisms from Precambrian to Cambrian times, as Darwin expected. But the 
problem posed by the Burgess Shale is not the increase in complexity, but the 
sudden quantum leap in complexity. The jump from the simpler Precambrian 
organisms (further explored in the next chapters) to the radically different 
Cambrian forms appears to occur far too suddenly to be readily explained by the 
gradual activity of natural selection and random variations. Neither the Burgess 
Shale nor any other series of sedimentary strata known in Walcott’s day recorded 
a pattern of novel body plans arising gradually from a sequence of intermediates. 
Instead, completely unique organisms such as the bizarre arthropod Opabinia 
(see Fig. 2.9)—with its fifteen articulated body segments, twenty-eight gills, 
thirty flipper-like swimming lobes, long trunk-like proboscis, intricate nervous 
system, and five separate eyes!*—appear fully formed in the Cambrian strata 
along with representatives of other fundamentally different body plans and 
designs of equal complexity. 

Darwin, as we know, regarded the sudden appearance of the Cambrian 
animals as a significant challenge to his theory.!? Where natural selection had to 
bridge yawning chasms from relatively simple life-forms to exquisitely complex 
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creatures, it would require great expanses of time. 








FIGURE 2.9 
Figure 2.9a (top): Artist rendering of Opabinia. Figure 2.9b (bottom): 
Photograph of Opabinia fossil. 


Darwin’s recognition!” of this constraint was prescient. Geologists in his day 
employed relative dating methods. They did not have modern radiometric 
methods for determining the “absolute” ages of rocks. For this reason, they did 
not yet fully understand how long it would have taken to accumulate the great 
columns of sedimentary rock and, thus, the great expanses of time that were 
available to the evolutionary process. Neither had scientists yet discovered the 
sophisticated inner workings of the cell, and the information-rich structures 
(DNA, RNA, and proteins) that had to be significantly altered to achieve even 
modest evolutionary changes. Nevertheless, Darwin was able—based upon what 
he knew of the complexity of organisms and his own understanding of how the 
mechanism of natural selection must operate—to deduce that descent with 
modification required time, and lots of it. 

Recalling the context of Darwin’s original argument reveals why. In the 
Origin, he sought to counter the famous watch-to-watchmaker design argument 
offered by theologian William Paley. Paley had argued that just as complex 
structures such as watches necessarily issue from intelligent watchmakers, the 
complex structures in living organisms must likewise owe their origin to a 


designing intelligence. With natural selection, Darwin proposed a purely natural 
mechanism for constructing the complex organs and structures (such as eyes) 
present in many forms of life. His mechanism of natural selection worked by 
constructing such systems one tiny step at a time, discarding the harmful 
variations and seizing upon the rare improvement. If evolution progressed by 
“whole watches”—that is, by entire anatomical systems like the trilobite’s eye— 
then biology would have fallen back to the old absurdity of imagining that a 
watch could fall together purely at random and all at once. Thus, unless 
Darwin’s evolutionary mechanism worked gradually by preserving the tiniest of 
random changes over many millions of years, it didn’t work at all. 


MORE MISSING LINKS 


Two other features of the Cambrian explosion revealed in the Burgess Shale, 
features (3) and (4) described earlier, not only confirmed the reality of the 
Cambrian mystery, but broadened and deepened it—and at just the time when 
paleontologists were looking to resolve the mystery with new fossil discoveries. 

First, the great profusion of completely novel forms of life in the Burgess 
assemblage (feature 3) demanded even more transitional forms than had 
previously been thought missing. Each new and exotic Cambrian creature—the 
anomalocarids (see Fig. 2.10), Marrella, Opabinia, and the bizarre and 
appropriately named Hallucigenia—for which there were again no obvious 
ancestral forms in the lower strata, required its own series of transitional 
ancestors. But where were they? 








FIGURE 2.10 
Figure 2.10a (top): Artist rendering of Anomalocaris. Figure 2.10b (bottom): 
Photograph of Anomalocaris fossil. Courtesy J. Y. Chen. 


Darwin had hoped that later fossil discoveries would eventually eliminate 
what he regarded as the one outstanding anomaly associated with his theory. 
Walcott’s discovery was not that discovery. Not only did the Burgess Shale fail 
to reveal the expected ancestral precursors of the known Cambrian animal forms, 


but it revealed a motley crew of previously unknown animal forms and body 
plans that now demanded their own lengthy chain of evolutionary precursors, 


only complicating the task of explaining the Cambrian explosion in Darwinian 
terms. 


ORDERS FROM THE TOP 


The Burgess Shale raised an additional difficulty (feature 4, discussed earlier), 
though not one that Walcott recognized during his lifetime. Instead, its 
exposition would await a later generation of Cambrian experts, particularly 
Stephen Jay Gould. Darwin’s theory implied that as new animal forms first 
began to emerge from a common ancestor, they would at first be quite similar to 
each other, and that large differences in the forms of life—what paleontologists 
call disparity—would only emerge much later as the result of the accumulation 
of many incremental changes. In its technical sense, disparity refers to the major 
differences in form that separate the higher-level taxonomic categories such as 
phyla, classes, and orders. In contrast, the term diversity refers to minor 
differences among organisms classified as different genera or species. Put 
another way, disparity refers to life’s basic themes; diversity refers to the 
variations on those themes. The more body plans in a fossil assembly, the greater 
the disparity. And the animal forms preserved in the Burgess Shale display 
considerable disparity. Further, the large differences in form between the first 
animals appeared suddenly in the Burgess Shale, and the appearance of such 
disparity arose before, not after, the diversification of many representatives of 
lower taxonomic categories (such as species or genera) within each higher 
category, designating a new body plan. 

The site of the Burgess Shale and its setting nicely illustrates the difference 
between diversity and disparity. Walcott’s celebrated quarry is tucked away in 
the Canadian Rockies near the Continental Divide. Reaching it involves a six- 
mile hike through the picturesque scenery of Yoho National Park—Takakkaw 
Falls, Emerald Lake, and glaciers and glacier-cut mountain peaks thrusting into 
view at almost every turn. In this ecologically diverse setting, hikers have a 
chance of spotting squirrels, marmots, deer, moose, elk, wolves, and mountain 
goats. Rare sightings might include a grizzly bear or Canadian lynx, while alert 
birdwatchers might glimpse a horned lark, a white-tailed ptarmigan, the rare 
water pipit, or a gray-crowned rosy finch; an eagle, hawk, or grassland falcon; 
dippers, jays, migrating warblers, or harlequin ducks.‘© 

As richly various as this array of animals is, all of them come from a single 
phylum, Chordata—and even from a single subphylum, Vertebrata. Imagine 
hiking to the quarry to excavate it and, on the hike, being lucky enough to spot 
every one of these animals along the way. After having feasted your eyes on 


such animal variety, when you arrive at Walcott’s quarry, it yields not merely 
dozens of fossilized species from a single subphylum, but wildly disparate 
creatures from dozens of phyla. 

According to Darwin’s theory, the differences in form, or “morphological 
distance,” between evolving organisms should increase gradually over time as 
small-scale variations accumulate by natural selection to produce increasingly 
complex forms and structures (including, eventually, new body plans). In other 
words, one would expect small-scale differences or diversity among species to 
precede large-scale morphological disparity among phyla. As the former Oxford 
University neo-Darwinian biologist Richard Dawkins puts it, “What had been 
distinct species within one genus become, in the fullness of time, distinct genera 
within one family. Later, families will be found to have diverged to the point 
where taxonomists (specialists in classification) prefer to call them orders, then 
classes, then phyla.”!’ 

Darwin himself made this point in On the Origin of Species. Explaining his 
famous tree diagram (see Fig. 2.11a), he noted that it illustrated more than just 
the theory of universal common descent. The tree diagram also illustrated how 
higher taxa should emerge from lower taxa by the accumulation of numerous 
slight variations. He said, “The diagram illustrates the steps by which small 
differences distinguishing varieties are increased into larger differences 
distinguishing species.”!® He went on to assert that the process of modification 
by natural selection would eventually move beyond the formation of species and 
genera to form “two distinct families, or orders, according to the amount of 
divergent modification supposed to be represented in the diagram.”!° In his 
view, this process would continue until it produced differences in form that were 
great enough that taxonomists would classify them as new classes or phyla. In 
short, diversity would precede disparity, and phyla-level differences in body plan 
would emerge only after species-, genus-, family-, order-, and class-level 
differences appeared. 

The actual pattern in the fossil record, however, contradicts this expectation 
(compare Fig. 2.12 to Fig. 2.11b). Instead of more and more species eventually 
leading to more genera, leading to more families, orders, classes, and phyla, the 
fossil record shows representatives of separate phyla appearing first followed by 
lower-level diversification on those basic themes. 

This is nowhere more dramatically apparent than in the Cambrian period 
explains Roger Lewin in the journal Science: “Several possible patterns exist for 
the establishment of higher taxa, the two most obvious of which are the bottom- 


up and the top-down approaches. In the first, evolutionary novelties emerge, bit 
by bit. The Cambrian explosion appears to conform to the second pattern, the 
top-down effect.”*° Or as paleontologists Douglas Erwin, James Valentine, and 
Jack Sepkoski note in their study of skeletonized marine invertebrates: “The 
fossil record suggests that the major pulse of diversification of phyla occurs 
before that of classes, classes before that of orders, orders before that of families. 
... The higher taxa do not seem to have diverged through an accumulation of 
lower taxa.”*! In other words, instead of a proliferation of species and other 
representatives of lower-level taxa occurring first and then building to the 
disparity of higher taxa, the highest taxonomic differences such as those between 
phyla and classes appear first (instantiated by relatively few species-level 
representatives). Only later, in more recent strata, does the fossil record 
document a proliferation of representatives of lower taxa: different orders, 
families, genera, and so on. Yet we would not expect the neo-Darwinian 
mechanism of natural selection acting on random genetic mutations to produce 
the top-down pattern that we observe in the history of life following the 
Cambrian explosion. 
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FIGURE 2.11 
Figure 2.11a (top): Darwin’s theory of common descent illustrated here with his 


famous branching tree of life diagram reproduced from the Origin of Species, 
1859. Figure 2.11b (bottom): Growth of the tree of life over time in the manner 
envisioned by Darwin with new species giving rise to new genera and families, 
eventually giving rise to new orders, classes, and phyla (these higher taxonomic 
categories not depicted). 


Of course, advocates of modern phylogenetic classification, with their “rank- 
free” approach, don’t describe this phenomenon as a “top-down” pattern, 
because their system of classification dispenses with taxonomic ranks and 
hierarchies. In their system, there is no “top” and no “down.” Nevertheless, 
advocates of phylogenetic classification do acknowledge that different 
combinations of “character” states (characteristics or features of organisms) can 
mark either bigger or smaller morphological differences between clades (closely 


related groups of organisms that presumably share a common ancestor). And 
some leading advocates of phylogenetic classification have noted that the fossil 
record exhibits a pattern in which a few character traits marking large 
morphological differences between clades arise first, followed later in each clade 
by the addition of other combinations of characters that mark smaller differences 
within those clades. Larger differences between clades arise first, smaller 
differences within them arose later—the themes precede the variations. 
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FIGURE 2.12 
The top-down pattern of appearance in the fossil record: disparity precedes 
diversity. 


The founder of the modern phylogenetic classification, Willi Hennig, for 
example, noted that once particular groups arise, the range of allowable 
variability within those groups narrows. In his classic work Phylogenetic 
Systematics, Hennig quoted another paleontologist approvingly who observed: 
“The breadth of evolution of successive groups shows a distinct narrowing, since 
the basic divergences of organization became progressively smaller. The type of 
mammals is more uniform and closed than that of the reptiles, which in turn is 
unquestionably uniform compared to that of the Amphibia-Stegocephalia.” 
Hennig goes on to explain that “the same phenomenon is repeated in every 
systematic unit of higher or lower order.” 

Yet, on a Darwinian view, small-scale variations and differences should arise 
first, gradually giving rise to larger-scale differences in form—just the opposite 
of the pattern evident in the fossil record. Thus, the discovery, and later analysis, 


of the Burgess revealed another puzzling feature of the fossil record from a 
Darwinian point of view, regardless of which system of classification 
paleontologists prefer to use. Indeed, Walcott’s discovery turned Darwin’s 
anticipated bottom-up—or small changes first, big changes later—pattern on its 
head. 


FIRST IMPRESSIONS 


The extraordinary conditions at work in the preservation of the Burgess fauna 
helped to reveal the extent of the rich diversity (and disparity) of form present in 
the Cambrian period. On shale of a very fine grain, the Burgess fossils look like 
lithographic pictures, dark on light (see color insert plates 15 and 16). Even the 
soft parts like the gills and guts are sometimes preserved. This is not the norm in 
the world of paleontology. Usually soft tissues decay before they can be 
fossilized, leaving behind only harder parts, such as bone, teeth, and shells, to be 
preserved. The Burgess event that captured the Cambrian fauna for future 
discovery was different. Although it took the lives of untold Cambrian animals, 
it did so with an exquisite delicacy that preserved soft tissue. 

Visualizing how this occurred will drive home why the conditions were so 
unusual. All of the fossilized animals of the Burgess Shale were sea creatures 
living near an enormous carbonate reef that later was thrust upward by plate 
tectonic activity to form what is now called the Cathedral Escarpment. Long 
after the sea creatures of the Burgess Shale were entombed, these tectonic forces 
drove the fossils upward from the seafloor carrying them many miles eastward 
along faults, at the same time building the mountains that Walcott would climb 
millions of years later. 

Thanks to this tectonic movement of earth’s major plates, the continents are 
now located in very different places than they were millions of years ago. At the 
time these Cambrian creatures were alive, the land masses that would later form 
North America lay on the equator. 

Plate tectonic activity explains why a trove of sea creatures were found 
fossilized in the mountains of Yoho National Park rather than along a seafloor 
somewhere. But there’s still the question of why so many different types of 
marine invertebrates, including soft-bodied ones, were so unusually well 
preserved. Paleontologists think they know the answer. They think the marine 
animals that were later fossilized in the Burgess Shale lived near the bottom of 
an ancient sea in front of an underwater cliff or escarpment. Due to tectonic 
activity, blocks at the edge of this underwater cliff began to break off. These 
blocks slumped, creating underwater mudflows in their wake. These slumps and 
flows transported the Burgess animals several kilometers into deeper waters 
where they were buried in such a way as to leave them not only undamaged, but 
also protected from scavengers and bacteria. Very probably, the mudflows were 


highly turbulent, for paleontologists found the creatures dumped and preserved 
in a variety of angles in relation to the bedding. The speed and pressure of these 
mudflows quickly produced a preservation-friendly, oxygen-free environment. 
Then the turbulent and muddy currents pressed fine silt and clay into the 
crevices of the bodies at just the right consistency and pressure to fossilize them 
without tearing their delicate appendages, an ideal set of circumstances for 
ensuring later observation by future paleontologists.~° 

Due in part to the unusual circumstances under which these fossils were 
preserved, there’s now little doubt about the unparalleled disparity of the 
Cambrian fauna. Based on available evidence from the Burgess Shale and other 
sites around the world, the Cambrian period witnessed the appearance of, 
arguably, more disparate body plans than ever before or since. And this disparity 
arose at a most unexpected time, assuming Darwinian theory, namely, right at the 
dawn of animal life. 


THE ARTIFACT HYPOTHESIS 


Walcott grasped these difficulties, and had a deep enough professional 
commitment to Darwinism, to search for a solution. He realized that the 
Precambrian fossil record could, in principle, assist in explaining the pattern of 
the Cambrian fossil record. The discovery of a rich Precambrian fossil history 
detailing variations accumulating little by little would serve to cast the pattern of 
the Burgess in a different light. Yet the Precambrian strata of his day showed no 
signs of providing any obvious transitional forms, much less a well-articulated 
bottom-up pattern of animals representing lower taxa proliferating into forms 
exemplifying higher and higher taxonomic categories. Nevertheless, an idea 
occurred to Walcott that gave him fresh hope. 

Perhaps his awareness of the dramatic ways that the surface of the earth had 
changed over geologic time, making preservation of the Burgess fauna itself 
possible, inspired him. Finding marine animals so high above sea level no doubt 
made Walcott acutely aware of the way in which continents and seas had 
changed locations over the course of geological time. And so, Walcott, ever the 
geologist, proposed an ingenious geological solution to the biological problem of 
the origin of animal life. He noted that the Precambrian period was a period of 
dramatic continental uplift. He then suggested that the ancestors of the trilobites 
first evolved at a time when the Precambrian seas had receded from the 
landmasses. Then, at the beginning of the Cambrian, the seas again rose, 
covering the continents and depositing recently evolved trilobites. Thus, 
according to Walcott, ancestral precursors to the trilobites and other distinctive 
Cambrian forms had existed, but they were not fossilized in sediments that 
would later be elevated above sea level until early in the Cambrian; instead, 
before the Cambrian, during a period when sea levels were lower, trilobites and 
their ancestral forms were being deposited offshore in what are now only deep- 
sea sediments.** Walcott named this cryptic period of time in which trilobites 
and other animals were rapidly evolving offshore as the “Lipalian interval.” (The 
term “Lipalian” is derived from the Greek word for lost.) In this view, the abrupt 
appearance of the Cambrian body plans in the geological column was merely an 
“artifact” of incomplete sampling of the fossil record and, indeed, the inability to 
access the undersea sedimentary layers where the ancestors of the Cambrian 
fauna presumably lay encased. In short, the transgression and regression of 
ancient seas made the ancestral precursors of the Cambrian fauna inaccessible to 


discovery. 

His artifact hypothesis (also known as the “Lipalian interval” hypothesis) was 
a distinct advance over Darwin’s unadorned claim that the fossil ancestors of the 
Cambrian animals had not yet been discovered. Walcott’s hypothesis had the 
advantage of accounting for the sudden appearance of the trilobites and the 
absence of ancestral and transitional forms by reference to known geological 
processes. It also could be tested, at least once offshore drilling technology 
advanced to allow for the sampling of the buried offshore sedimentary rocks. 

Although Walcott conceded that his hypothesis was essentially a negative 
argument that attempted to explain away the absence of evidence, he insisted 
that it was a sensible inference from his broad sampling of the paleontological 
data. “I fully realize that the conclusions above outlined are based primarily on 
the absence of a marine fauna in Algonkian [Precambrian] rocks,” he wrote, “but 
until such is discovered I know of no more probable explanation of the abrupt 
appearance of the Cambrian fauna than that I have presented.”° 


LUMPING AND SPLITTING 


Walcott used another strategy for squaring the Burgess Shale with Darwin’s 
theory of evolution. Taxonomists, tasked with identifying and naming distinct 
groups of life-forms, have been divided into two types: “lumpers” and 
“splitters.” “Lumpers” tend to group disparate organisms together in the same 
large classificatory categories and then make distinctions between them at lower 
taxonomic levels. “Splitters” tend to separate similar organisms into numerous 
higher taxonomic divisions. Walcott favored lumping, and his doing so with the 
Burgess fossils seemingly minimized the difficulties associated with the sudden 
proliferation of so many new Cambrian forms. 

On his return to the Smithsonian, he placed all of the exotic forms of the 
Burgess into modern phyla. One of his efforts at lumping placed Marrella 
splendens not only in the same phylum, but also in the same class (Trilobita) as 
the trilobites, despite obvious morphological differences. He justified this 
classification by arguing that the organism foreshadowed the trilobite (compare 
Figs. 1.4 and 2.3). Gould later criticized Walcott’s method of classification as 
“shoehorning.” He noted that even one of Walcott’s fellow lumpers, Yale 
paleontologist Charles Schuchert, called the classification of Marrella into 
question.*° Gould also noted that Walcott used this strategy to minimize the 
challenge posed by the morphological disparity of the Burgess forms.” 

Some paleontologists today reject Gould’s criticism of Walcott’s inclusion of 
so many Burgess animal forms into existing taxonomic categories.Nevertheless, 
few paleontologists think Walcott’s use of lumping explained away the 
Cambrian explosion. Most, for example, classify Marrella splendens within an 
existing modern phylum, namely, Arthropoda, even if they also classify it within 
a new and separate class, Marrellomorpha. Yet, whether Marrella, for example, 
falls within a novel phylum or class, matters less than explaining why so many 
clearly novel forms, and the novel structures these forms exhibit, first arose with 
such apparent suddenness. 


RESOLUTION—FOR A TIME 


Walcott thought that he had solved the mystery of the Cambrian explosion, as 
did many other Darwinists who gratefully adopted both his taxonomy and his 
version of the artifact hypothesis. And since Walcott’s approach held out hope of 


one day uncovering evidence of a Precambrian trunk for the animal phyla along 
with its primary limbs, adherents could not be accused of moving the 
paleontological case for Darwinism into the realm of untestable dogma. They 
had only to wait for the technologies of seafloor drilling to emerge and hope that 
nature had seen fit, unmolested under the ocean deeps, to leave concrete 
evidence of the gradual emergence of the major Cambrian body plans. 

Walcott’s theoretical accomplishment was no mean feat. His discovery of the 
Burgess Shale was like a defense attorney with absolute faith in his client 
stumbling upon a room stuffed with clues that would seem to discredit him. 
Through his grouping of disparate body types into existing phyla and his 
ingenious version of the artifact hypothesis, Walcott had found an elegant way to 
explain all this seemingly uncooperative evidence in a Darwinian way. 

In defending Walcott for overlooking significant features of the Burgess 
fossils, Gould points out that Walcott’s multiple and growing administrative 
demands hardly left time to revisit the foundational categories of animal 
taxonomy. How much less, then, was Walcott likely to revisit the most 
fundamental assumption of all—the assumption that animals originated 
gradually in a Darwinian way as the result of natural selection acting on small 
incremental variations? Consideration of that possibility would come decades 
later, only after Walcott’s version of the artifact hypothesis had itself exploded. 


3 
Soft Bodies and Hard Facts 


In the spring of 2000, Discovery Institute, where I do my research, sponsored a 
lecture at the University of Washington geology department by renowned 
Chinese paleontologist J. Y. Chen (see Fig. 3.1). As the result of his role in 
excavating a new discovery of Cambrian-era fossils in southern China, Professor 
Chen’s standing in the scientific world was on the rise. The discovery, near the 
town of Chengjiang in the Kunming Province, revealed a trove of early 
Cambrian animal forms. After Time magazine mentioned the Chengjiang 
discovery in a 1995 cover story about the Cambrian explosion,! interest in the 
fossils surged. When he came to Seattle, Professor Chen had already published 
numerous scientific papers about this profusion of novel life-forms and had 
established himself as one of the foremost experts on the fossils in this unique 
geological setting. 

Not surprisingly, Chen’s visit generated considerable interest among the 
University of Washington faculty. He came bearing intriguing photographs and 
samples of the oldest and most exquisitely preserved Cambrian fossils in the 
world from an exotic site halfway around the globe, a site, moreover, that was 
now widely acknowledged to surpass even the legendary Burgess Shale as the 
most extensive and significant Cambrian-era locality. 

The fossils from the Maotianshan Shale near Chengjiang (see Fig. 3.2) had 
established an even greater variety of Cambrian body plans from an even older 
layer of Cambrian rock than those of the Burgess, and they did so with an almost 
photographic fidelity. The Chinese fossils also helped to establish that the 
Cambrian animals appeared even more explosively than previously realized. 
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FIGURE 3.1 
J. Y. Chen. 
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FIGURE 3.2 

Figure 3.2a (left) shows the Moatianshan Shale outcrop. Courtesy Illustra 
Media. Figures 3.2b and c (center and right) show a Precambrian-Cambrian 
boundary marker at the Moatianshan site. Courtesy Paul Chien. 


So there was little doubt about the significance of the discoveries that Chen 
came to report that day. What was soon in doubt, however, was Chen’s scientific 
orthodoxy. In his presentation, he highlighted the apparent contradiction between 
the Chinese fossil evidence and Darwinian orthodoxy. As a result, one professor 
in the audience asked Chen, almost as if in warning, if he wasn’t nervous about 
expressing his doubts about Darwinism so freely—especially given China’s 


reputation for suppressing dissenting opinion. I remember Chen’s wry smile as 
he answered. “In China,” he said, “we can criticize Darwin, but not the 
government. In America, you can criticize the government, but not Darwin.” 

Nevertheless, those in the audience that day soon learned that Professor Chen 
had good reasons for questioning Darwin’s picture of the history of life. As Chen 
explained, the Chinese fossils turn Darwin’s tree of life “upside down.” They 
also cast doubt on a surviving version of Charles Walcott’s artifact hypothesis, a 
crucial prop in the case for Darwinian gradualism. 


BURGESS REVISITED BY THE TIME CHARLES WALCOTT FINISHED 
HIS LAST EXCAVATION OF THE BURGESS SHALE IN 1917, HE AND HIS 
TEAM HAD COLLECTED OVER 65,000 FOSSIL SPECIMENS, ALL OF 
WHICH WERE SHIPPED TO THE NATURAL HISTORY MUSEUM AT THE 
SMITHSONIAN INSTITUTION FOR CATALOGUING. IN 1930, ANOTHER 
AMERICAN PALEONTOLOGIST, HARVARD PROFESSOR PERCY 
RAYMOND, INITIATED ANOTHER INVESTIGATION OF THE BURGESS. 
HIS SPECIMENS WERE ALSO EVENTUALLY STORED IN THE UNITED 
STATES. 


As a result of these two prominent American-led excavations, there were 
initially no collections of Burgess fossils on public display in Canada. Many 
Canadian scientists regarded this as a national embarrassment, so in the 1960s 
the Canadian Geological Survey commissioned a British team to resume digging 
at the Walcott quarry, in order to “repatriate the Burgess Shale” by keeping most 
of the newly discovered fossils on permanent display in Canada.* The team was 
led by paleontologist Harry Whittington (see Fig. 3.3), of the University of 
Cambridge, who was assisted by two of his graduate students, Simon Conway 
Morris and Derek Briggs, both of whom would eventually distinguish 
themselves as internationally renowned experts on the Burgess Shale. 
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FIGURE 3.3 
Harry Whittington. Courtesy Archives of the Museum of Comparative Zoology, 
Ernst Mayr Library, Harvard University. 


As Whittington analyzed the Cambrian fauna at the Burgess, he realized that 
Walcott had grossly underestimated the morphological disparity of this group of 
animals. Many of the creatures in the assemblage featured unique body designs, 
unique anatomical structures, or both.* Opabinia, with its five eyes, fifteen 
distinct body segments, and a claw at the end of a long proboscis, exemplified 
the unique forms on display in the Burgess. But so did Hallucigenia, Wiwaxia, 
Nectocaris, and many other Burgess animals. To this day, paleontologists 
describing Nectocaris, for example, can’t decide whether it more closely 
resembles an arthropod, a chordate, or a cephalopod (a class of mollusk; see Fig. 
3.4). 

Whittington found that grouping such forms within well-established 
taxonomic categories, even higher taxonomic categories such as the class or 
phylum, strained the limits of these classifications. Even many of those animals 
that fell easily into existing phyla represented clearly unique subphyla or classes 
of organisms. Anomalocaris (literally, “abnormal shrimp”) and Marrella, for 
example, both had hard exoskeletons and clearly represent either arthropods or 
creatures closely related to them. Yet each of these animals possessed many 
distinct anatomical parts and exemplified different ways of organizing these 


parts, thus clearly distinguishing themselves from better-known arthropods such 
as the previous staple of Cambrian paleontological studies, the trilobite. 
Whittington, a trilobite expert, understood this as well as anyone. In 1971, he 

published the first comprehensive taxonomic review of the Burgess biota. In his 
review, he broke decisively with Walcott’s previous attempt to lump all 
Cambrian forms into a few preexisting taxonomic categories. 
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FIGURE 3.4 
Figure 3.4a (left): Artist rendering of Nectocaris. Figure 3.4b (middle, right): 
Photograph of Nectocaris fossil. Reprinted by permission from Macmillan 
Publishers Ltd.: Nature, Martin R. Smith and Jean-Bernard Caron, “Primitive 
Soft-Bodied Cephalopods from the Cambrian,” Nature, 465 (May 27, 2010): 
469-72. Copyright 2010. 


In so doing, he reemphasized the morphological disparity present in the 
Burgess animal biota and, in the process, deprived evolutionary biologists of one 
part of Walcott’s two-part strategy for minimizing the Cambrian problem. By 
lumping all Burgess animals into existing phyla and classes, Walcott had 
seemingly diminished the problem of disparity by reducing the number of novel 
phyla for which connecting intermediates were required. By recognizing the 
disparity clearly on display in the Burgess, Whittington partially undercut 
Walcott’s solution to the Cambrian mystery and highlighted what would become 
its central unsolved problem: the origin of novel biological form. 


SOUNDING THE SEAFLOOR ALTHOUGH WHITTINGTON, AND 
LATER GOULD, REJECTED WALCOTT’S EARLY ATTEMPT TO 
“SHOEHORN” ALL THE ANIMALS OF THE BURGESS SHALE INTO 
PREEXISTING TAXONOMIC CATEGORIES, MANY PALEONTOLOGISTS 
NOW ALSO REJECT STEPHEN JAY GOULD’S CHARACTERIZATION OF 
MANY BURGESS SHALE CREATURES AS BEING SO EXOTIC AS TO 
DEFY AFFINITY IN CLASSIFICATION WITH ANY MODERN GROUPS.4 
MANY OF THESE PALEONTOLOGISTS WOULD ALSO RECOGNIZE 
FEWER TOTAL PHYLA FIRST APPEARING IN THE CAMBRIAN THAN 
GOULD DID, AND PERHAPS EVEN AS FEW AS WALCOTT DID. AS 
DISCUSSED IN THE PREVIOUS CHAPTER, STILL OTHER 
PALEONTOLOGISTS NOW FAVOR “RANK-FREE” APPROACHES TO 
CLASSIFICATION. 


Regardless, most paleontologists recognize that the Burgess Shale attests to 
an extraordinary profusion of new animal forms—including many manifesting 
unique anatomical structures and arrangements of body parts. Thus, whatever 
differences of opinion exist about how to classify these animals—and any five- 
year-old child can distinguish them from each other and from all previously 
known forms of life—their origin still requires explanation. Thus, as noted, 
Walcott’s use of “lumping” did not solve the Cambrian mystery.° 

But what of the second part of Walcott’s proposal, the artifact hypothesis? To 
evaluate this hypothesis, Walcott devised a more clear-cut and less subjective 
test. Recall that Walcott argued that the ancestral precursors of the Cambrian 
animals were missing from the Precambrian fossil record because of the 
transgression and regression of seas. He posited an interval of geologic time in 
which the ancestors of the Cambrian fauna were evolving offshore in a 
Precambrian ocean and being deposited only in layers of marine sedimentary 
rock. In this hypothesis, only after the ancient ocean rose and covered the 
continent were the remains of Cambrian sea animals preserved in sediment that 
today is above sea level. 

When Walcott proposed his ingenious geological scenario, it could not yet be 
tested. But with the development of offshore drilling technology in the 1940s, 
1950s, and 1960s, oil companies began to drill through thousands of feet of 
marine sedimentary rock.® As geologists evaluated the contents of these drill 
cores, they did not find Walcott’s predicted Precambrian fossils. 

Instead, an even more fundamental problem for the hypothesis arose. At the 


time that Walcott proposed his version of the artifact hypothesis, geologists 
considered the oceanic and continental plates to be essentially stable and fixed 
with respect to one another. Mountain building, faulting, and other geologic 
processes were attributed to worldwide changes in sea level, accumulating 
troughs of sediment called geosynclines, rising mounds of igneous rocks from 
beneath the earth’s crust, and even a shrinking earth.’ 

The idea that enormous, solid plates actually move, recycling themselves 
through the plate-tectonic processes of subduction and seafloor spreading, had 
not yet been proposed. Yet modern plate tectonic theory now affirms that oceanic 
crustal material eventually plunges back into the earth and melts in a process 
known as subduction. After surface rocks melt during subduction, they form a 
new supply of molten magma. Eventually, magma from other locations deep in 
the earth wells up at mid-oceanic ridges to form new igneous rocks, in a process 
known as seafloor spreading. It follows that any oceanic sediments deposited 
atop the oceanic igneous crust will have a limited “life span” on the surface of 
the earth. Eventually, these sedimentary rocks collide with the continental 
margin, plunge deep into the upper mantle, and melt to form magma. 

As a consequence of this cycle, the maximum age of any marine sediment is 
strictly limited. And according to modern estimates, the oldest section of oceanic 
crust has existed only since the Jurassic (or about 180 million years ago®)—far 
too young to contain fossil ancestors of the trilobites. As the evidence for plate 
tectonics mounted, scientists discarded Walcott’s artifact hypothesis and Lipalian 
interval as nonstarters. Paleontologists today do not expect to find any 
Precambrian ancestors of the trilobites in oceanic sediments, since they realize 
that there are no Precambrian sediments in the ocean basins. If Precambrian 
strata are to be found anywhere, continents are the place. 


OTHER VERSIONS OF THE ARTIFACT HYPOTHESIS ALTHOUGH 
WALCOTT’S PROPOSALS TO EXPLAIN AWAY THE ABSENCE OF 
FOSSILIZED ANCESTORS OF THE CAMBRIAN ANIMALS CAME TO 
NAUGHT, OTHER VERSIONS OF THE ARTIFACT HYPOTHESIS 
CONTINUED TO CIRCULATE. THESE PROPOSALS TAKE TWO BASIC 
FORMS. SOME SCIENTISTS CLAIMED, THOUGH FOR DIFFERENT 
REASONS, THAT THE EXPECTED PRECAMBRIAN FOSSIL ANCESTORS 
HAD SIMPLY NOT YET BEEN FOUND—THAT MISSING FOSSILS WERE 
AN ARTIFACT OF THE INCOMPLETE SAMPLING OF THE FOSSIL 
RECORD. OTHERS SUGGESTED THAT PRECAMBRIAN SEDIMENTARY 


ROCKS HAD NOT PRESERVED THE MISSING FOSSILS—THAT THE 
INCOMPLETE PRESERVATION OF THE PRECAMBRIAN ANIMALS 
MEANT THE MISSING FOSSILS WERE NO LONGER THERE TO BE 

FOUND. 


Walcott rejected the idea that paleontologists simply had not looked in, or 
sampled, enough places. He noted that geologists already had extensively 
investigated “the great series of Cambrian and Precambrian strata in eastern 
North America.” Though they had looked “from Alabama to Labrador; in 
western North America [and] from Nevada and California far into Alberta and 
British Columbia, and also in China” their investigations had turned up nothing 
of significant interest.? In Walcott’s view the continents simply had not 
preserved the fossilized remains of the Cambrian ancestors. 

Before Walcott, some geologists had gone a step farther and suggested that 
all Precambrian sedimentary rocks had been destroyed via extreme heat and 
pressure, a process called “universal metamorphism.” Walcott rejected this 
hypothesis, since he himself had encountered a “great series of pre-Cambrian 
sedimentary rocks on the North American continent” among other places. Other 
geologists suggested that major bursts of evolutionary innovation occurred only 
during periods when sedimentary deposition had ceased, thus again resulting in a 
lack of fossil preservation. But, as Gould remarked of Walcott’s artifact 
hypothesis, this explanation also appeared to many scientists “forced and ad hoc 
... born of frustration, rather than the pleasure of discovery.”!° 


CONTEMPORARY VERSIONS OF THE ARTIFACT HYPOTHESIS: 
TOO SOFT OR TOO SMALL AFTER THE DEMISE OF THE 
“UNIVERSAL METAMORPHISM” IDEA, SOME PALEONTOLOGISTS 
PROPOSED SIMPLER, MORE INTUITIVELY PLAUSIBLE VERSIONS OF 
THE ARTIFACT HYPOTHESIS. THEY CLAIMED THAT THE PROPOSED 
INTERMEDIATE FORMS LEADING TO THE CAMBRIAN ANIMALS MAY 
HAVE BEEN EITHER TOO SMALL OR TOO SOFT, OR BOTH, TO HAVE 
BEEN PRESERVED. 


Developmental biologist Eric Davidson, of California Institute of Technology, 
has suggested that the transitional forms leading to the Cambrian animals were 
“microscopic forms similar to modern marine larvae” and were thus too small to 
have been reliably fossilized.'' Other evolutionary scientists, such as Gregory 


Wray, Jeffrey Levinton, and Leo Shapiro, have suggested that the ancestors of 
the Cambrian animals were not preserved, because they lacked hard parts such 
as shells and exoskeletons.!* They argue that since soft-bodied animals are 
difficult to fossilize, we shouldn’t expect to find the remains of the supposedly 
soft-bodied ancestors of the Cambrian fauna in the Precambrian fossil record. 
University of California, Berkeley, paleontologist Charles R. Marshall 
summarizes these explanations: [I]t is important to remember that we see the 
Cambrian “explosion” through the windows permitted by the fossil and 
geological records. So when talking about the Cambrian “explosion,” we are 
typically referring to the appearance of large-body (can be seen by the naked 
eye) and preservable (and therefore largely skeletonized) forms. . . . If the stem 
lineages were both small and unskeletonized, then we would not expect to see 
them in the fossil record.'° 

Though intuitively plausible, several discoveries call into question both of 
these versions of the artifact hypothesis. As for the idea that the ancestors of the 
Cambrian animals were too small to be preserved, paleontologists have known 
for some time that the cells of filament-shaped microorganisms (probably 
cyanobacteria) have been preserved in ancient Precambrian rocks. Paleobiologist 
J. William Schopf, of the University of California, Los Angeles, has reported an 
extremely ancient example of these fossils in the Warrawoona Group strata of 
western Australia. These fossilized cyanobacteria are preserved in 3.465-billion- 
year-old bedded cherts (microcrystalline sedimentary rocks).'* The same strata 
have also preserved stromatolite mats, an organic accretionary growth structure 
usually indicating the presence of bacteria, within slightly younger dolostone 
sediments of roughly 3.45 billion years in age (see Fig. 3.5).!° 











FIGURE 3.5 
Figure 3.5a (left): Photographs of Cambrian-age fossil stromatolites. Courtesy 


Wikimedia Commons, user Rygel, M. C. Figure 3.5b (right): Alternating fine-and 
coarse-layered structure of Precambrian stromatolite fossils shown in cross 
section. Courtesy American Association for the Advancement of Science, Figure 
2B, Hoffman, P., “Algal Stromatolites: Use in Stratigraphic Correlation and 
Paleocurrent Determination,” Science, 157 (September 1, 1967): 1043-45. 
Reprinted with permission from AAAS. 


These discoveries pose a problem for the idea that the Cambrian ancestors 
were too small to survive in the fossil record. The sedimentary rocks that 
preserve the fossilized cyanobacteria and single-celled algae are far older and, 
therefore, far more likely to have been destroyed by tectonic activity than those 
later sedimentary rocks that should have preserved the near-ancestors of the 
Cambrian animals. Yet these rocks, and the fossils they contain, have survived 
just fine. If paleontologists can find tiny fossilized cells in these far older and 
rarer formations, shouldn’t they also be able to find some ancestral forms of the 
Cambrian animals in younger and more abundant sedimentary rocks? Yet few 
such precursors have been found. 

There are also several reasons to question the second version of this 
hypothesis—the idea that the presumed Cambrian ancestors were too soft to be 
preserved. First, some paleontologists have questioned whether soft-bodied 
ancestral forms of the hard-bodied Cambrian animals would have even been 
anatomically viable.'° They argue that many animals representing phyla such as 
brachiopods and arthropods could not have evolved their soft parts first and then 
added shells later, since their survival depends upon their ability to protect their 
soft parts from hostile environmental forces. Instead, they argue that soft and 
hard parts had to arise together.'” As paleontologist James Valentine, of the 
University of California, Berkeley, has noted in the case of brachiopods, “The 
brachiopod Bauplan [body plan] cannot function without a durable skeleton.”!® 
Or as J. Y. Chen and his colleague Gui-Qing Zhou observe: “Animals such as 


brachiopods . . . cannot exist without a mineralized skeleton. Arthropods bear 
jointed appendages and likewise require a hard, organic or mineralized outer 
covering.” !? 


Because these animals typically require hard parts, Chen and Zhou assume 
that the ancestral forms of these animals should have been preserved somewhere 
in the Precambrian fossil record if in fact they were ever present. Thus, the 
absence of hard-bodied ancestors of these Cambrian animals in the Precambrian 
strata shows that these animals first arose in the Cambrian period. As they rather 


emphatically insist: “The observation that such fossils are absent in Precambrian 
strata proves that these phyla arose in the Cambrian.”7° 

It should be pointed out that this argument cannot be made for all Cambrian 
animal groups and, in my view, does not achieve the standing of a “proof” in any 
case. Many Cambrian phyla, including phyla characterized by mostly hard- 
shelled animals such as mollusks and echinoderms, do have soft-bodied 
representatives. The earliest known mollusk, Kimberella, for example, lacked a 
hard external shell (though it did have other hard parts).*! So, clearly, some 
mainly hard-shelled Cambrian groups could have had soft-bodied ancestors. 

It is also possible to postulate the existence of an arthropod or brachiopod 
ancestor—especially some extremely distant ancestor—lacking a hard shell. 
Soft-bodied onychophorans (velvet worms) were once proposed as ancestors of 
the arthropods, though more recent studies challenge this idea. Onychophorans 
themselves arise well after arthropods in the fossil record and cladistics analysis 
suggests that onychophorans may be a sister, rather than an ancestral, group to 
arthropods.** Even so, it’s difficult to disprove a negative: in particular, to 
foreclose the possibility that arthropods or brachiopods might have had a soft- 
bodied ancestor deep in the Precambrian. 

Nevertheless, it seems unlikely on a Darwinian view of the history of life that 
all Cambrian arthropod or brachiopod ancestors, especially the relatively recent 
ancestors of these animals, would have lacked hard parts entirely. There are 
many types of arthropods that arise suddenly in the Cambrian—trilobites, 
Marrella, Fuxianhuia protensa, Waptia, Anomalocaris—and all of these animals 
had hard exoskeletons or body parts. Moreover, the only known extant group of 
arthropods without a hard exoskeleton (the pentastomids) have a parasitic 
relationship with arthropods that do.*° Thus, surely, it seems likely that some of 
the near ancestors of the many arthropod animals that arose in the Cambrian 
would have left at least some rudimentary remains of exoskeletons in the 
Precambrian fossil record—if, in fact, such ancestral arthropods existed in the 
Precambrian and if arthropods arose in a gradual Darwinian way. 

Moreover, the arthropod exoskeleton is part of a tightly integrated anatomical 
system. Specific muscles, tissues, tendons, sensory organs—and a special 
mediating structure between the soft tissue of the animal and the exoskeleton 
called the endophragmal system—are all integrated to support the process of 
molting and exoskeletal growth and maintenance that is integral to the arthropod 
mode of existence. A best-case Darwinian scenario for the origin of such a 
system would, therefore, envision the “co-evolution” of these separate 


anatomical subsystems in a coordinated fashion, since some of these anatomical 
subsystems confer a functional advantage to the animal largely by supporting, 
and promoting, the growth and maintenance of the exoskeleton (and vice versa). 
Others would be vulnerable to damage without it. Thus, it seems unlikely that 
these interdependent subsystems would evolve independently first without an 
exoskeleton, only to have the exoskeleton arise suddenly as a kind of accretion 
atop an already integrated system of soft parts at the end of a long evolutionary 
process. 

This, again, makes it reasonable to expect that at least some rudimentary 
arthropod hard parts would have been preserved in the Precambrian if arthropods 
were present then. That such parts are unknown for all Cambrian arthropods 
(and brachiopods) in a fossil record that presumably favors hard-part 
preservation, seems at least curious. And it appears, on its face, to support the 
assertions of those Cambrian paleontologists such as Chen and Zhou who take 
the absence of any hard parts in the Precambrian record as evidence of the 
absence of those groups that typically depend on hard parts for their existence. 

In any case, advocates of the artifact hypothesis must at least explain a 
Cambrian explosion of hard body parts, if not whole Cambrian animals. As 
paleontologist George Gaylord Simpson noted in 1983, even if it’s true that 
Precambrian ancestors were not preserved simply because they lacked hard 
parts, “there is still a mystery to speculate about: Why and how did many 
animals begin to have hard parts—skeletons of sorts—with apparent suddenness 
around the beginning of the Cambrian?”* 

There is an additional, more formidable, difficulty for this version of the 
artifact hypothesis. Although the fossil record generally does not preserve soft 
body parts as frequently as hard parts, it has preserved many soft-bodied 
animals, organs, and anatomical structures from both the Cambrian and the 
Precambrian periods. 

As we Saw earlier, Precambrian sedimentary rocks in several places around 
the world have preserved fossilized colonial blue-green algae, single-celled 
algae, and cells with a nucleus (eukaryotes).*? These microorganisms were not 
only small, but they also entirely lacked hard parts. Another class of late 
Precambrian organisms called the Vendian or Ediacaran biota included the 
fossilized remains of many soft-bodied organisms, including many that may well 
have been lichens, algae, or protists (microorganisms with cells containing 
nuclei). Cambrian-era strata themselves preserve many soft-bodied creatures and 
structures. The Burgess Shale in particular preserved the soft parts of several 


types of hard-bodied Cambrian animals, such as Marrella splendens,~© 
Wiwaxia,*’ and Anomalocaris. The Burgess Shale also documents entirely soft- 
bodied representatives~® of several phyla, including: 

e Cnidaria (represented by an animal called Thaumaptilon, a feather- 
shaped colonial organism formed from smaller soft sea anemone—like 
animals)? 

e Annelida (represented by the polychaete worms Burgessochaeta and 
Canadia)°*° 

e Priapulida (represented by Ottoia, Louisella, Selkirkia—all worms with 
a distinctive proboscis)*! 

e Ctenophora (represented by Ctenorhabdotus, a gelatinous animal with a 
translucent body similar to a modern comb jelly)°? 

e Lobopodia (represented by Aysheaia and Hallucigenia, segmented soft- 
bodied animals with many legs)*° 


The Burgess also preserves soft-bodied animals of unknown affinities, such 
as Amiskwia, a gelatinous air mattress—like animal;** Eldonia, a jellyfish-like 
animal with a much more complex anatomy than a modern jellyfish;*° and the 
aforementioned, difficult to classify, Nectocaris.2° As Simon Conway Morris 
notes, “The existing [Burgess] collections represent approximately 70,000 
specimens. Of these, about 95 percent are either soft-bodied or have thin 
skeletons.”?” 


THE CHENGJIANG EXPLOSION ANY DOUBTS ABOUT THE ABILITY 
OF SEDIMENTARY ROCKS TO PRESERVE SOFT AND SMALL BODY 
PARTS WERE PERMANENTLY LAID TO REST BY A SERIES OF 
DRAMATIC FOSSIL FINDS IN SOUTHERN CHINA BEGINNING IN THE 
1980S. 


In June 1984, paleontologist Xian-Guang Hou journeyed to Kunming, in 
southern China, to prospect for fossilized samples of a bivalved arthropod called 
a bradoriid.*° The area around Kunming in the Yunnan Province was well known 
for its lower Cambrian strata and typical Cambrian-era fossils, such as bradoriids 
and trilobites, both of which were relatively easy to preserve because of their 
characteristically hard exoskeletons. In 1980, Hou had found many bradoriid 
samples in a geological formation called the Qiongzhusi Section near Kunming. 


In the summer of 1984 Hou traveled to the town of Chengjiang to look for 
bradoriids in another geological formation called the Heilinpu Formation. His 
efforts there yielded little success. As a result, he turned his attention to another 
outcrop, a sedimentary sequence now called the Maotianshan Shale. Hou’s team 
set farmworkers to digging out and scouring the mudstone blocks. His book, The 
Cambrian Fossils of Chengjiang, China, describes what happened next: At about 
three o’clock in the afternoon of Sunday July 1, a semicircular white film was 
discovered in a split slab, and was mistakenly thought to represent the valve of 
an unknown crustacean. With the realization that this . . . represented a 
previously unreported species, breaking of the rock in a search for additional 
fossils continued apace. With the find of another specimen, a 4—5 cm long 
animal with limbs preserved, it became apparent that here was nothing less than 
a soft-bodied biota.*” 

Hou remembers the Cambrian specimen vividly, for it appeared “as if it was 
alive on the wet surface of the mudstone.”*? Redoubling their efforts, the 
researchers quickly uncovered the fossilized remains of one extraordinary soft- 
bodied animal after another. Most of the fossils were preserved as flattened two- 
dimensional imprints of three-dimensional organisms, although, as Hou 
observes, “some retain a low three-dimensional relief.”*! Most important, he 
notes, “The remains of hard tissues, such as the shells of brachiopods or the 
carapaces of trilobites, are well represented in the Chengjiang fauna, but less 
robust tissues, which are usually lost through decomposition, are also beautifully 
preserved.” 

As the result of the very fine, small-grained sediments in which they were 
deposited, the Chengjiang fossils preserved anatomical details with a fidelity 
surpassing even that of the Burgess fauna.*? The Maotianshan Shale also 
preserved an even greater variety of soft-bodied animals and anatomical parts 
than the Burgess Shale had done. In the ensuing years, Hou and his closest 
colleagues, J. Y. Chen and Gui-Qing Zhou, found many excellent examples of 
well-preserved animals lacking even a keratinized exoskeleton, including soft- 
bodied members of phyla such as Cnidaria (corals and jellyfish), Ctenophora 
(comb jellies), Annelida (a type of “ringed” segmented worm), Onychophora 
(segmented worms with legs), Phoronida (a tubular, filter-feeding marine 
invertebrate), and Priapulida (another distinctive type of worm). (See Fig. 3.6.) 
They found fossils preserving the anatomical details of numerous soft tissues 
and organs such as eyes, intestines, stomachs, digestive glands, sensory organs, 


epidermes, bristles, mouths, and nerves.*° They also discovered jellyfish-like 
organisms called Eldonia, which exhibit delicate, soft body parts such as 
radiating water canals and nerve rings. Other fossils even revealed the contents 
of the guts of several animals.*° 

The discoveries near Chengjiang demonstrated beyond any reasonable doubt 
that sedimentary rocks can preserve soft-bodied fossils of great antiquity and in 
exquisite detail, thereby challenging the idea that the absence of Precambrian 
ancestors is a consequence of the fossil record’s inability to preserve soft-bodied 
animals from that period. At the same time, the sedimentary rocks near 
Chengjiang had other surprises in store. 


PRECAMBRIAN SECRETS PAUL CHIEN IS A CHINESE-AMERICAN 
MARINE BIOLOGIST WHO, AS A BOY, LEFT MAINLAND CHINA WITH 
HIS FAMILY TO ESCAPE THE 1949 COMMUNIST TAKEOVER UNDER 
MAO TSE-TUNG. EVENTUALLY, AFTER COMPLETING PH.D. STUDIES 
IN THE UNITED STATES, HE BECAME A BIOLOGY PROFESSOR AT THE 
UNIVERSITY OF SAN FRANCISCO. HE FIRST LEARNED ABOUT THE 
DISCOVERIES IN SOUTHERN CHINA AFTER READING THE TIME 
MAGAZINE COVER STORY IN 1995. THEN HE LEARNED, IRONICALLY, 
FROM A STORY IN PEOPLE’S DAILY, THE OFFICIAL NEWSPAPER OF 
THE CHINESE COMMUNIST PARTY, THAT SOME CHINESE 
PALEONTOLOGISTS THOUGHT THAT THESE DISCOVERIES 
CHALLENGED A DARWINIAN VIEW OF THE HISTORY OF LIFE. 


The great variety of the marine invertebrates present in Chengjiang fascinated 
Chien and convinced him to return to the country of his birth. After making his 
first trip in the summer of 1996, he met J. Y. Chen. As Paul Chien returned over 
several successive summers to do research of his own, he and J. Y. Chen 
continued to share their findings and compare notes. 

Upon arriving in China in 1998 for his third summer of research, Paul Chien 
learned that Chen had discovered a fossil of an adult sponge in the late 
Precambrian rocks of a sedimentary formation called the Doushantuo 
Phosphorite—a formation that lies beneath the Maotianshan Shale. As the two 
scientists examined the sediments that encased Chen’s fossil sponge, they made 
a discovery that would doom the most popular remaining version of the artifact 
hypothesis. 


Figure 3.6c: 
The shrimp- 
like arthropod, 
Waptia. 





Figure 3.6d: 
A priapulid 
worm with 
its distinctive 
proboscis. 





Figure 3.6a: A 
ciliated comb jelly, 
from the phylum 
Ctenophora. 


Figure 3.6e: 
The enigmatic 
jelly fish-like 
Eldonia. 


Figure 3.6f: 

A conical 
shelled animal 
from the 
phylum 
Hyolitha. 





Figure 3.6b: A filter- 
feeding worm-like 
member of the 
phylum Phoronida. 


FIGURE 3.6 

Cambrian explosion fossils from Chengjiang fauna, artist depictions and fossil 
photos. Photos in 3.6a, b, e, and f courtesy J. Y. Chen. Photos in 3.6c and d 
courtesy Paul Chien. 








As J. Y. Chen began to examine the sedimentary rocks that enclosed his 
fossilized sponge, he decided to look at them in a so-called thin section under a 
light microscope. Chen wondered whether smaller embryonic forms of these 
Precambrian animals might also have been preserved in these phosphorite rocks. 
Sure enough, under magnification he found little round balls that he and Paul 
Chien identified as sponge embryos. In 1999, at a major international conference 
about the Cambrian explosion held near Chengjiang, J. Y. Chen, Paul Chien, and 
three other colleagues presented their findings.*” 

A number of Chinese paleontologists questioned them at first, suggesting that 
the little round balls were not sponge embryos at all, but instead the remains of 


brown and green algae.*® Here Paul Chien’s expertise came to the fore. Early in 
his career, Chien had perfected a technique for examining the embryos of living 
sponges under a scanning electron microscope. He now adapted his technique to 
examine these microscopic fossilized structures using a more powerful 
microscope. What he found startled him and amazed other scientists. 

Sponges are nature’s glassworks. They are made of a soft and flexible lattice 
of cells from which protrude silica-encrusted “spicules.” Though sponges come 
in a variety of shapes and sizes, they are one of the simplest known forms of 
animal life, with between six and ten distinctive cell types.*? In comparison, the 
typical arthropod has between thirty-five and ninety cell types. 

As Chien examined the little balls of the Doushantuo Phosphorite under the 
scanning microscope, he noticed what looked like cells undergoing cell division. 
At first, he had no way to determine what type of cells they might be. But as he 
examined cross sections of these cells more carefully, he identified a distinctive 
structure that he knew from his prior research. 

Only sponges have spicules, and the fossilized cells he was examining 
preserved microscopic spicules in the early stages of their development.?? 
Clearly, these were not algal balls; they were sponge embryos. Even more 
surprising, Chien was able to observe the internal structure of these embryonic 
cells, allowing him to identify the nuclei of some of these cells within the 
fossilized remains of the larger outer cell membrane (see Fig. 3.7). 

The discovery of these sponge embryos has proven decisive in the case 
against the remaining versions of the artifact hypothesis, for several reasons. 

First, though spicules in sponges are encased in a thin layer of glassy silica, 
sponges are generally considered to be a soft-bodied organism because of the 
predominantly soft tissues out of which the rest of their bodies are made. 
Moreover, the cells of all embryos during their earliest embryonic stages are soft. 
Even in animals that have internal or external skeletons, the nascent forms of 
these hard parts do not emerge until gastrulation, partway through the process of 
embryological development. Thus, discovery of an embryo in the earliest stages 
of cell division shows beyond a doubt that Precambrian sedimentary rocks can, 
under the right circumstances, preserve soft-bodied organisms. 

It also established something else. J. Y. Chen found these sponge embryos 
beneath the Cambrian—Precambrian boundary in late Precambrian rock. Yet 
these Precambrian layers did not preserve remains of any clearly ancestral or 
intermediate forms leading to the other main groups of Cambrian animals. This 
raised an obvious question. If the Precambrian sedimentary strata beneath the 


Maotianshan Shale preserved the soft tissues of tiny, microscopic sponge 
embryos, why didn’t they also preserve the near ancestors of the whole animals 
that arose in the Cambrian, especially since some of those animals must have 
had at least some hard parts as a condition of their viability? If these strata could 
preserve embryos, then they should have preserved fully developed animals—at 
least, if such animals were present at the time. That well-developed, clearly 
ancestral animal forms were not preserved, when tiny sponge embryos”! were, 
strongly indicates that such forms were simply not present in the Precambrian 
layers. 
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FIGURE 3.7 

Figure 3.7a (above): Photographs of fossilized sponge embryos in the early 
stages of cell division, showing the sponge at the 8-cell stage of division, with 
four of its cells marked in the foreground. Figure 3.7b (right): A close-up image 
of a fossil of a sponge embryo cell revealing numerous yolk granules inside. 
Courtesy Paul Chien. 


Of course, there are conditions under which fossils are unlikely to be 
preserved. We know, for example, that near-shore sands do not favor 
preservation of detail, let alone the fine detail of very small organisms a 
millimeter or less in length.°* Even so, such considerations do little to bolster the 
artifact hypothesis. The sedimentary environments that produced the carbonates, 


phosphorites, and shales of the Precambrian strata beneath the Maotianshan 
Shale, for example, would have provided a congenial environment for fossilizing 
all sorts of creatures during Precambrian times. 

Advocates of the artifact hypothesis need to show not just that certain factors 
discourage preservation in general. No one disputes that. What they need to 
show is that these factors were ubiquitous in Precambrian depositional 
environments worldwide. If near-shore sands characterized all Precambrian 
sedimentary deposits, then paleontologists would not expect to find any fossils 
there, at least not any tiny ones. Yet clearly this is not the case. Precambrian 
strata include many types of sediments that can preserve—and in the case of the 
Doushantuo formation in China, have preserved—animal remains in fine detail, 
including small and vulnerable sponge embryos. 

Moreover, geologists Mark and Dianna McMenamin have noted that in many 
other Cambrian locales around the world, including one in Newfoundland that 
they have studied extensively, the pattern of sedimentation changes very little 
across the Cambrian—late Precambrian boundary, suggesting that many 
Precambrian environments would have provided equally good environments for 
the preservation of fossils.>° 

In their 2013 book, The Cambrian Explosion, paleontologists James Valentine 
and Douglas Erwin go further. They note that many late Precambrian 
depositional environments actually provide more favorable settings for the 
preservation of fossils than those present in the Cambrian period. As they note, 
“a revolutionary change in the sedimentary environment—from microbially 
stabilized sediments during the Ediacaran [late Precambrian] to biologically 
churned sediments as larger, more active animals appeared—occurred during the 
early Cambrian. Thus, the quality of fossil preservation in some settings may 
have actually declined from the Ediacaran to the Cambrian, the opposite of what 
has sometimes been claimed, yet we find a rich and widespread explosion of 
[Cambrian] fauna.” 


STATISTICAL PALEONTOLOGY RECENT WORK IN A FIELD KNOWN 
AS STATISTICAL PALEONTOLOGY CASTS FURTHER DOUBT ON THE 
ARTIFACT HYPOTHESIS. SINCE THE DISCOVERY OF THE BURGESS 
SHALE, PRECAMBRIAN-AND CAMBRIAN-ERA DISCOVERIES HAVE 

REPEATEDLY UNCOVERED FOSSIL FORMS THAT EITHER ESTABLISH 
RADICALLY DISPARATE NEW FORMS OF LIFE OR, INCREASINGLY, 
FORMS THAT FALL INTO EXISTING HIGHER TAXONOMIC GROUPS 


(SUCH AS CLASS, SUBPHYLUM, OR PHYLUM). 


As a result, the fossil record amply documents organisms corresponding to 
the terminal branches on the Darwinian tree of life (animal forms representing 
new phyla or classes, for example), but it fails to preserve those organisms 
representing the internal branches or nodes leading to these representatives of 
novel phyla and classes of Cambrian-era animals. Yet these intermediates are the 
very forms required to connect the terminal branches to form a coherent 
evolutionary tree and establish that the representatives of the Cambrian animals 
did arise by means of a gradual evolutionary process from simpler Precambrian 
ancestors. 

Recall that Louis Agassiz thought that this pattern could not be explained by 
appealing to an incomplete fossil record, because the fossil record was strangely 
selective in its incompleteness, preserving abundant evidence of the terminal 
branches but consistently neglecting to preserve the representatives of the 
internal branches or nodes. 

Contemporary paleontologists, such as Michael Foote at the University of 
Chicago, have come to a similar conclusion. Foote has shown, using statistical 
sampling analysis, that as more and more fossil discoveries fall within existing 
higher taxonomic groups (e.g., phyla, subphyla, and classes), and as they fail to 
document the rainbow of intermediate forms expected in the Darwinian view of 
the history of life, it grows ever more improbable that the absence of 
intermediate forms reflects a sampling bias—that is, an “artifact” of either 
incomplete sampling or preservation. 

This kind of analysis merely quantifies what, in other circumstances, we 
would sense intuitively. Imagine that you reach into an enormous barrel full of 
marbles and randomly pull out a yellow, a red, and a blue marble. At this point 
your brief sampling should leave you undecided as to whether you have a 
representative sample of the barrel’s contents. You might at first imagine that the 
barrel also contains marbles representing a rainbow of intermediate colors. But 
as you continue to sample from every place in the barrel and find that the barrel 
disgorges only those same three colors you begin to suspect that it may offer a 
much more limited selection of colors than, say, the rack of color samples at 
your local paint store. 

Over the past 150 years or so, paleontologists have found many 
representatives of the phyla that were well known in Darwin’s time (by analogy, 
the equivalent of the three primary colors) and a few completely new forms 


altogether (by analogy, some other distinct colors such as green and orange, 
perhaps). And, of course, within these phyla there is a great deal of variety. 
Nevertheless, the analogy holds at least insofar as the differences in form 
between any member of one phylum and any member of another phylum are 
vast, and paleontologists have utterly failed to find forms that would fill these 
yawning chasms in what biologists call “morphological space.” In other words, 
they have failed to find the paleontological equivalent of the numerous finely 
graded intermediate colors (Pendleton blue, dusty rose, gun barrel gray, magenta, 
etc.) that interior designers covet. Instead, extensive sampling of the fossil record 
has confirmed a strikingly discontinuous pattern in which representatives of the 
major phyla stand in stark isolation from members of other phyla, without 
intermediate forms filling the intervening morphological space. 

Foote’s statistical analysis of this pattern, documented by an ever increasing 
number of paleontological investigations, demonstrates just how improbable it is 
that there ever existed a myriad of as yet undiscovered intermediate forms of 
animal life—forms that could close the morphological distance between the 
Cambrian phyla one tiny evolutionary step at a time. In effect, Foote’s analysis 
suggests that since paleontologists have reached repeatedly into the proverbial 
barrel, sampled it from one end to the other, and found only representatives of 
various radically distinct phyla but no rainbow of intermediates, we shouldn’t 
hold our breath expecting such intermediates to eventually emerge. He asks 
“whether we have a representative sample of morphological diversity and 
therefore can rely on patterns documented in the fossil record.” The answer, he 
says, is yes. 

By this affirmation, he doesn’t mean that there are no biological forms left to 
discover. He means, rather, that we have good reason to conclude that such 
discoveries will not alter the largely discontinuous pattern that has emerged. 
“Although we have much to learn about the evolution of form,” he writes, the 
Statistical pattern created by our existing fossil data demonstrates that “in many 
respects our view of the history of biological diversity is mature.”°° 


CHENGJIANG AND THE CAMBRIAN CONUNDRUM”’ 


The Cambrian and Precambrian fossils from southern China have rendered the 
mystery associated with the Cambrian explosion more acute in other ways as 
well. First, the fossil finds in southern China coupled with advances in 
radiometric dating techniques applied to other Cambrian-era strata have allowed 


scientists to reassess the duration of the Cambrian explosion. As the name 
implies, the fossils documenting the Cambrian explosion appear within a 
relatively narrow slice of geologic time. Until the early 1990s, most 
paleontologists thought the Cambrian period began 570 million and ended 510 
million years ago, with the Cambrian explosion of novel animal forms occurring 
within a 20-to 40-million-year window during the lower Cambrian period. 

Two developments have led paleontologists and geochronologists to revise 
those estimates downward. First, in 1993, radiometric dating of zircon crystals 
from formations just above and below Cambrian strata in Siberia allowed for a 
precise redating of Cambrian strata. Radiometric analyses of these crystals fixed 
the start of the Cambrian period at 544 million years ago,°® and the beginning of 
the Cambrian explosion itself to about 530 million years ago (see Fig. 3.8). 
These studies also suggested that the explosion of the novel Cambrian animal 
forms occurred within a window of geologic time much shorter than previously 
believed, lasting no more than 10 million years, and the main “period of 
exponential increase of diversification” lasting only 5 to 6 million years.°° 
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FIGURE 3.8 
The Cambrian explosion occurred within a narrow window of geological time. 





Geologically speaking, 5 million years represents a mere 1/10 of 1 percent 
(0.11 percent, to be precise) of earth’s history. J. Y. Chen explains that 
“compared with the 3-plus-billion-year history of life on earth, the period [of the 
explosion] can be likened to one minute in 24 hours of one day.” 


Some geologists or evolutionary biologists dispute these numbers, but they do 
so by redefining the Cambrian explosion as a series of separate events rather 
than using the term to refer to the main radiation of new body plans in the lower 
Cambrian. In 2009, I participated in a debate in which one of my opponents, 
paleontologist Donald Prothero, from Occidental College, used this common 
rhetorical strategy to minimize the severity of the Cambrian mystery. In his 
opening statement, he claimed that the Cambrian explosion actually took place 
over an 80-million-year period of time and that consequently those who cited the 
Cambrian as a challenge to the adequacy of the neo-Darwinian theory were 
mistaken. As I was listening to his opening statement, I consulted his textbook to 
see how he had derived his 80-million-year figure. Sure enough, he had included 
in the Cambrian explosion three separate pulses of new innovation or 
diversification, including the origin of a group of late Precambrian organisms 
called the Ediacaran or Vendian fauna. He also included not only the origin of 
the animal body plans in the lower Cambrian, but also the subsequent minor 
diversification (variations on those basic architectural themes) that occurred in 
the upper Cambrian. He included, for example, not just the appearance of the 
first trilobites, which occurred suddenly in the lower Cambrian, but also the 
origin of a variety of different trilobite species later from the upper Cambrian. 

In my response to Prothero, I noted that he was, of course, free to redefine the 
term “Cambrian explosion” any way he liked, but that by using the term to 
describe several separate explosions (of different kinds), he had done nothing to 
diminish the difficulty of explaining the origin of the first explosive appearance 
of the Cambrian animals with their unique body plans and complex anatomical 
features. Beyond this, as we’ll see in the next chapter, the Vendian organisms 
may not have been animals at all, and they bear little resemblance to any of the 
animals that arise in the Cambrian. We’ll also see that most, if not all,®! of these 
organisms actually went extinct well before the origin of the animals that first 
appear in the lower Cambrian and so they do little to minimize the problem of 
the explosive origin of animals. 

In any case, expanding the definition of the Cambrian explosion only 
obscures the real challenge posed by the event, a challenge underscored by the 
discoveries at Chengjiang. An analysis by MIT geochronologist Samuel 
Bowring has shown that the main pulse of Cambrian morphological innovation 
occurred in a sedimentary sequence spanning no more than 6 million years.°* Yet 
during this time representatives of at least sixteen completely novel phyla and 
about thirty classes first appeared in the rock record. In a more recent paper 


using a slightly different dating scheme, Douglas Erwin and colleagues similarly 
show that thirteen new phyla appear in a roughly 6-million-year window.®* As 
we’ve seen, among these animal forms were the first trilobites, with their lens- 
focusing compound eyes among other complex anatomical features. The 
problem of explaining how so many new forms and structures arose so rapidly in 
the first explosive period of the Cambrian remains, whether or not one decides to 
include within the designation “Cambrian explosion” other distinct events (see 
Part Two). 


THE TOP-DOWN PATTERN IN STARK RELIEF THE CHENGJIANG 
FAUNA MAKES THE CAMBRIAN EXPLOSION MORE DIFFICULT TO 
RECONCILE WITH THE DARWINIAN VIEW FOR YET ANOTHER 
REASON. THE CHENGJIANG DISCOVERIES INTENSIFY THE TOP- 
DOWN PATTERN OF APPEARANCE IN WHICH INDIVIDUAL 
REPRESENTATIVES OF THE HIGHER TAXONOMIC CATEGORIES 
(PHYLA, SUBPHYLA, AND CLASSES) APPEAR AND ONLY LATER 
DIVERSIFY INTO THE LOWER TAXONOMIC CATEGORIES (FAMILIES, 
GENERA, AND SPECIES). 


Discoveries at Chengjiang contradict the bottom-up pattern that neo- 
Darwinism expects. The site does not show the gradual emergence of unique 
species followed slowly but surely by the emergence of representatives of ever 
higher and more disparate taxa, leading to novel phyla. Instead, like the Burgess 
Shale, it shows body plan—level disparity arising first and suddenly, with no 
evidence of a gradual unfolding and ranging through the lower taxonomic 
groups. 

Consider the case of the early chordates, a phylum consisting of creatures 
possessing a flexible, rod-shaped structure called a notochord. Mammals, fish, 
and birds are familiar members of this phylum. Prior to the discovery of the 
Chengjiang biota, chordates were unknown in the Cambrian period and were 
thought to have first appeared only much later, during the Ordovician period.™ 
Now, following the discoveries at Chengjiang, the first appearance of chordates 
in the Cambrian period has been amply documented. 

For example, J. Y. Chen and several other Chinese paleontologists have found 
a spindle-shaped eel-like animal called Yunnanozoon lividum, which many 
paleontologists have interpreted as an early chordate because it possesses, 
among other features, a digestive tract, branchial arches, and a large notochord 


much like a spinal cord.® In addition, J. Y. Chen and colleagues have reported 
the discovery of a sophisticated craniate-like chordate called Haikouella 
lanceolata from the lower Cambrian Maotianshan Shale. According to Chen and 
others, Haikouella has many of the same features of Yunnanozoon lividum as 
well as several additional anatomical features including a “heart, ventral and 
dorsal aorta, an anterior branchial arterial, gill filaments, a caudal (posterior) 
projection, a neural cord with a relatively large brain, a head with possible lateral 
eyes, and a ventrally situated buccal cavity with short tentacles.”°° 








FIGURE 3.9 
Figure 3.9a (top): Drawing of the Cambrian fish, Myllokunmingia fengjiaoa. 
Figure 3.9b (bottom): Photograph of Myllokunmingia fengjiaoa fossil. Reprinted 
by permission from Macmillan Publishers Ltd.: Nature, Shu et al., “Lower 
Cambrian Vertebrates from South China,” Nature, 402 (November 4, 1999): 42— 


46. Copyright 1999. 


Simon Conway Morris, with D. G. Shu and several Chinese colleagues, has 
reported an even more dramatic find. They have discovered the fossilized 
remains of two small Cambrian fish, Myllokunmingia fengjiaoa and 
Haikouichthys ercaicunensis (see Fig. 3.9), suggesting a much earlier appearance 
for both fishes and vertebrates (a class of chordates), both of which were first 
thought to have originated in the Ordovician period, about 475 million years ago. 
Both of these taxa are jawless fish (agnathans) and are considered by Shu and 
others to be closely allied to modern lampreys.°’ Finally, a paper by Shu and 
others reports the first convincing specimen of another type of chordate from the 
Cambrian, a urochordate (tunicate).°° This specimen, Cheungkongella 
ancestralis, is likewise found in the early Cambrian shales (Qiongzhusi 
Formation) near Chengjiang. These recent finds demonstrate that not only did 
the phylum Chordata first appear in the Cambrian, but also each one of the 
chordate subphyla (Cephalochordata, Craniata, and Urochordata) first emerged 
then as well. In any case, the discovery in China of chordates, and other 
previously undiscovered phyla in the Cambrian, only accentuates the puzzling 
top-down pattern of appearance that other Cambrian discoveries had previously 
established.°? 


MORE QUESTIONS THAN ANSWERS THUS, DESPITE THE EFFORTS 
TO EXPLAIN THE CAMBRIAN EXPLOSION USING VARIOUS VERSIONS 
OF THE ARTIFACT HYPOTHESES, THE MYSTERY OF THE CAMBRIAN 
EXPLOSION HAS ONLY GROWN MORE ACUTE AS A RESULT OF THE 
DRAMATIC DISCOVERIES IN SOUTHERN CHINA THAT HAVE TURNED 
DARWIN’S TREE OF LIFE UPSIDE DOWN. 


When I first heard J. Y. Chen describe these discoveries in 2000, I had been 
investigating another unsolved question about the history of life: What caused 
the first living cell, and the information it contains, to arise? As I heard Dr. Chen 
speak that day in Seattle, my interest in another puzzling question about the 
history of life began to germinate. Could it be that the origin of animal life was 
in its own way just as difficult a problem as that of the origin of life itself? 
Though I eventually concluded that the Cambrian explosion does, indeed, 
present a profound challenge to contemporary Darwinian theory, it didn’t take 
me long to discover that some scientists believed that the mystery of the 


Cambrian explosion had already been resolved by the discovery of some rather 
enigmatic Precambrian fossils. We turn to those next. 


4 
The Not Missing Fossils? 


The atmosphere in the auditorium of the Sam Noble Science Museum at the 
University of Oklahoma was uncomfortably tense, with a security detail of local 
Norman, Oklahoma, police officers on hand to keep the peace—a conspicuous 
change from the campus security guard who might be present at a typical 
university event. The occasion? A new documentary, Darwin’s Dilemma, that 
Jonathan Wells, a colleague of mine from the Discovery Institute, and I were 
scheduled to show. The film would explore the challenge to Darwin’s theory 
posed by the Cambrian fossil record. 

For weeks before our event in September of 2009, outspoken evolutionary 
biology students and an atheist student group, both egged on by militant off- 
campus bloggers, had threatened to disrupt the screening. Members of the 
biology faculty pledged to come, so that well before the official start time a large 
crowd had gathered. 

The museum and the geology department, not wanting to complicate matters 
in their own minds by watching the film first, decided to launch a preemptive 
first strike by issuing a disclaimer and scheduling an official lecture designed to 
rebut the film. In the disclaimer the museum stated that, given its public funding, 
it had no choice but to rent the auditorium to groups irrespective of their 
“religious beliefs” or “scientific literacy.” The disclaimer further noted that the 
Sam Noble Science Museum did “not support unscientific views masquerading 
as science, such as those of the Discovery Institute.” The museum flyer also 
announced the lecture of one of their curators, a paleontologist from the 
university, and mocked the topic of the film—the Cambrian “explosion”—with 
carefully placed scare quotes. The lecture’s start time of 5:00 pM. also ensured a 
confluence afterward of the audience made upset by the lecture and the more 
friendly audience coming to see the film. 

Jonathan Wells, a biologist well known for his skepticism about 
contemporary Darwinian theory, attended the pre-film lecture. Ignoring a few 


hostile glares, he listened as the paleontologist from the university argued that 
the Cambrian explosion presented no actual dilemma for Darwinian evolution, 
and as this same paleontologist speculated that if Darwin had only known what 
paleontologists today know of the Cambrian fossil record, he (Darwin) would 
have celebrated it as confirmation of his theory. This particular paleontologist 
also denied that novel animal forms emerged suddenly in the Cambrian. Instead, 
he argued that they arose in rudimentary form much earlier in the late 
Precambrian. He noted that paleontologists had discovered in late Precambrian 
sediments fossilized sponges, a type of primitive mollusk, and the burrows of 
worms. 

He also laid particular stress on the significance of a group of enigmatic 
organisms first discovered in the Ediacaran Hills in southern Australia dating 
from about 565 million years ago, in a Precambrian period known as the Vendian 
or Ediacaran. Jonathan and I were well aware that most paleontologists do not 
regard these fossilized organisms as plausible ancestors to the Cambrian fauna. 
But on that evening the expert from the university claimed the opposite. He also 
claimed that some obscure Ediacaran organisms (with exotic names such as 
Vernanimalcula, Parvancorina, and Arkarua) represented early bilaterians 
(bilaterally symmetrical animals), arthropods, and echinoderms. He insisted that 
these organisms pushed back the explosion of animal life by some 40 million 
years, establishing a “fuse” for the Cambrian explosion in the form of primitive 
and presumably ancestral animal forms for several of the most significant 
Cambrian phyla and body designs. 

An hour before I was to walk over to lead a discussion and answer questions 
about the Cambrian film from what proved to be an intensely hostile audience, 
Jonathan Wells called me with a report on the museum’s attempt to refute the 
film preemptively. The presentation claimed to have resolved the Cambrian 
mystery—Darwin’s dilemma—by showing that the ancestral precursors to the 
major groups of Cambrian animals had been found after all. But is this true? 


THE EDIACARAN FAUNA AND VENDIAN RADIATION IN THE 
PREVIOUS CHAPTER, WE SAW THAT MANY PROMINENT 
PALEONTOLOGISTS HAVE SOUGHT TO EXPLAIN THE CAMBRIAN 
EXPLOSION AS AN ARTIFACT OF OUR INCOMPLETE SAMPLING OF 
AN INCOMPLETE FOSSIL RECORD. THE LECTURE THAT MY 
COLLEAGUE HEARD THAT NIGHT IN OKLAHOMA TOOK A VERY 
DIFFERENT APPROACH, GIVING THE STRONG IMPRESSION THAT 


THE PRECAMBRIAN FOSSIL RECORD ACTUALLY DOES PRESERVE 
THE ANCESTRAL FORMS OF THE CAMBRIAN ANIMALS AND THAT 
THE EDIACARAN FAUNA, IN PARTICULAR, PROVIDE SEVERAL 
STRIKING EXAMPLES OF SUCH FORMS. 


In public presentations about the Cambrian explosion, I’ve often encountered 
this claim, though usually in the form of an unfocused question: “What about the 
Ediacaran?” Nevertheless, in writing about the Cambrian, I take care not to 
attribute the idea that the Ediacaran fauna represent Cambrian ancestors to 
leading Ediacaran or Cambrian experts, lest I critique a straw man. Most 
paleontologists doubt that well-known Ediacaran forms represent ancestors of 
the Cambrian animals and few think the late Precambrian fossil record as a 
whole makes the Cambrian explosion appreciably less explosive. The claim is 
important to address, however, since it persists as a kind of paleontological urban 
legend, one that even occasionally finds its way into the mouths of 
paleontologists. 

The Ediacaran fauna derive their name from their most notable discovery site, 
the Ediacaran Hills in the outback of southeastern Australia. These fauna date 
from late Precambrian time, a period that the International Union of Geological 
Sciences has recently renamed the “Ediacaran period.”! Since geologists used to 
call the last period of Precambrian time the “Vendian period,” paleontologists 
also refer to the Ediacaran fauna as the Vendian fauna or biota (see Fig. 1.6). 
Paleontologists have made additional discoveries of Edicaran-or Vendian-era 
creatures in England, Newfoundland, the White Sea in northwestern Russia, and 
the Namibian desert in southern Africa, suggesting a near worldwide 
distribution. Although these fossils were originally dated to between 700 and 
640 million years old, volcanic ash beds both below and above the Namibian site 
have recently provided more accurate radiometric dates. These studies fix the 
date for the first appearance of the Ediacaran fauna at about 570—565 million 
years ago, and the last appearance at the Cambrian boundary about 543 million 
years ago, or about 13 million years before the onset of the Cambrian explosion 
itself. 

The late Precambrian-era sediments around the world have yielded four main 
types of fossils, all of which are dated between about 570 and 543 million years 
ago. The first group consists of the Precambrian sponges mentioned in the 
previous chapter. These animals first arose about 570 to 565 million years ago. 
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Figure 4.1a: Artist depiction of Dickinsonia (left) and photograph of Dickinsonia 
fossil (right). 
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Figure 4.1b: Artist depiction of Figure 4.1c: Artist depiction of Charnia 
Spriggina (left) and photograph of (left), and photograph of Charnia fossil 
Spriggina fossil (right). (right). Courtesy Wikimedia Commons, 
user Smith609. 
FIGURE 4.1 


Examples of enigmatic Ediacaran fossils: Dickinsonia, Spriggina, and Charnia. 
Fossil photos in Figures 4.1a and 4.1b courtesy Peterson, K. J., Cotton, J. A., 
Gehling, J. G., and Pisani, D., “The Ediacaran Emergence of Bilaterians: 
Congruence Between the Genetic and the Geological Fossil Records,” 
Philosophical Transactions of the Royal Society B, 2008, 363 (1496): 1435-43, 
Figure 2, by permission of the Royal Society. 


The second is the distinctive group of fossils from the Ediacaran Hills. The 
creatures fossilized there include such well-known forms as the flat, air mattress- 
like body of Dickinsonia; the enigmatic Spriggina, with its elongated and 
segmented body and possible head shield; and the frond-like Charnia (see Fig. 
4.1). These organisms were at least mostly soft-bodied and are large enough to 
identify with the naked eye. 

The third group includes what are called trace fossils, the possible remains of 


animal activity such as tracks, burrows, and fecal pellets. Some paleontologists 
have attributed these trace fossils to ancient worms. 

The fourth group is the fossils of what may be primitive mollusks, a 
possibility that received support from a recent discovery in the cliffs along the 
White Sea in northwestern Russia. There, Russian scientists have discovered 
thirty-five distinctive specimens of a possible mollusk called Kimberella, 
probably a simple animal form. These new White Sea specimens, dated to 550 
million years ago, suggest that Kimberella “had a strong [though not hard], 
limpet-like shell, crept along the sea floor, and resembled a mollusk.”* 
Paleontologist Douglas Erwin, of the Smithsonian Institution, has commented: 
“Tt’s the first animal that you can convincingly demonstrate is more complicated 
than a flatworm.”* Additionally, seafloor tracks from Precambrian sediments in 
both Canada and Australia have been attributed to mollusks, since the tracks 
resemble what might have been left by a row of small teeth on the tongue-like 
ribbon of some mollusks as they scraped food particles off the seafloor. In this 
case, Kimberella may well have been the track maker.? The authors of the 
original descriptive paper in Nature, Mikhail Fedonkin, from the Russian 
Academy of Sciences, and Benjamin Waggoner, then at the University of 
California at Berkeley, conclude as much and suggest that such creatures “began 
to diversify before the beginning of the Cambrian.”° Paleontologists, however, 
are still weighing the evidence.’ 


THE SIGNIFICANCE OF THE EDIACARAN SO DO EITHER THE 
REMAINS OF THE SPECIFIC ORGANISMS FROM THE EDIACARAN 
HILLS OR THE EDIACARAN OR VENDIAN BIOTA AS A WHOLE SOLVE 
THE PROBLEM OF THE CAMBRIAN EXPLOSION? DO THESE EXOTIC 
FORMS REPRESENT A KIND OF FUSE TO THE CAMBRIAN EXPLOSION 
THAT ELIMINATES THE NEED TO EXPLAIN THE RAPID EMERGENCE 
OF NOVEL BODY PLANS AND FORMS OF ANIMAL LIFE? THERE ARE 
MANY GOOD REASONS TO DOUBT THIS IDEA. 


First, with the exception of sponges and the possible exception of Kimberella, 
the body plans of visibly fossilized organisms (as opposed to trace fossils) bear 
no clear relationship to any of the organisms that appear in the Cambrian 
explosion (or thereafter). The most noted Ediacaran organisms such as 
Dickinsonia, Spriggina, and Charnia do not have an obvious head, a mouth, 
bilateral symmetry (see below), a gut, or sense organs such as eyes. Some 


paleontologists question whether these organisms even belong in the animal 
kingdom. 

Dickinsonia, for example, has been interpreted by University of Oregon 
paleontologist Gregory Retallack as having “fungal-lichen” affinities, since its 
mode of fossil preservation “is comparable not with that of soft-bodied jellyfish, 
worms, and cnidarians, but with the fossil record of fungi and lichens.” 
Dickinsonia’s taxonomic position, Retallack notes, has long been an unsolved 
puzzle. “Biological affinities of Dickinsonia remain problematic,” he writes, 
since it has been “variously considered a polychaete, turbellarian or annelid 
worm, jellyfish, polyp, xenophyophoran protist, lichen or mushroom.”? 

Similar disputes have characterized attempts to classify Spriggina. In 1976, 
Martin Glaessner, the first paleontologist to study the Ediacaran in detail, 
described Spriggina as a possible annelid polychaete worm based largely upon 
its segmented body. Nevertheless, Simon Conway Morris later rejected that 
hypothesis because Spriggina shows no evidence of the distinguishing “chaetes,” 
leg-like bristled protrusions that polychaete worms possess. Glaessner himself 
later repudiated his original hypothesis that Spriggina was ancestral to 
polychaetes, noting that Spriggina “cannot be considered as a_ primitive 
polychaete, having none of the possible ancestral characters indicated . . . by 
specialists on the systematics and evolution of this group.” !° 

In 1981, paleontologist Sven Jorgen Birket-Smith produced a reconstruction 
of a Spriggina fossil showing that it possessed a head and legs similar to those of 
trilobites, though examinations of subsequent Spriggina specimens have shown 
no evidence of it possessing limbs of any kind.'' In 1984, Glaessner weighed in 
on this discussion as well. He argued that “Spriggina shows no specific 
characters of the arthropods, particularly of the trilobites.”!* He also noted that 
the body segmentation of Spriggina, and “its known appendages are at the level 
of polychaete annelids”!? (although, as noted, by this time he had rejected 
Spriggina as a possible polychaete ancestor). Instead, he proposed that Spriggina 
represented a side branch on the animal tree of life—one that resulted, 
“metaphorically” perhaps, in “an unsuccessful attempt to make an arthropod.” 

In a presentation to the Geological Society of America in 2003, geologist 
Mark McMenamin revived the idea that Spriggina might represent a trilobite 
ancestor. He argued that several features present in Spriggina fossils are 
comparable to those in trilobites such as “the presence of genal spines” and an 
effaced head or “cephalic region.”'4 Nevertheless, many Ediacaran experts, 
including McMenamin, have also noted that Spriggina specimens show no 


evidence of eyes, limbs, mouths, or anuses, most of which are known from fossil 
trilobites.” Other paleontologists remain skeptical about whether Spriggina does 
in fact exhibit genal spines, noting that good specimens seem to show relatively 
smooth edges with no protruding spines.'° In addition, analysis of the best 
specimens of Spriggina shows that it does not exhibit bilateral symmetry, 
undermining earlier attempts to classify it as a bilaterian animal, and by 
implication an arthropod.!’ Instead, Spriggina exhibits something called “glide 
symmetry” in which the body segments on either side of its midline are offset 
rather than aligned.'® As geologist Loren Babcock of Ohio State University 
notes, “The zipper-like body plans of some Ediacaran (Proterozoic) animals such 
as Dickinsonia and Spriggina involve right and left halves that are not perfect 
mirror images of each other.”!’ The lack of such symmetry, a distinctive feature 
of all bilaterian animals, and the absence in Spriggina specimens of many other 
distinguishing features of trilobites, has left the classification of this enigmatic 
organism uncertain. 

Paleontologists James Valentine, Douglas Erwin, and David Jablonski distill 
the confusing welter of conflicting views about the Ediacaran fossils: “Although 
the soft-bodied fossils that appear about 565 million years ago are animal-like, 
their classifications are hotly debated. In just the past few years these fossils 
have been viewed as protozoans; as lichens; as close relatives of the cnidarians; 
as a sister group to cnidarians plus all other animals; as representatives of more 
advanced, extinct phyla; and as representatives of a new kingdom entirely 
separate from the animals.”2° What’s more, Valentine, Erwin, and Jablonski note 
that those paleontologists who do regard the Ediacaran fauna as animals rarely 
classify them the same way, underscoring their lack of clear affinities to any 
known animal groups. As they note, “Still other specialists have parceled the 
fauna out among living phyla, with some assigned to the Cnidaria and others to 
the flatworms, annelids, arthropods and echinoderms.”*! The uncertain standing 
of these fossilized forms is partly due to their early extinction, but it also stems 
from an absence of defining characteristics shared with known groups. They 
conclude: “This confusing state of affairs arose because these body fossils do not 
tend to share definitive anatomical details with modem groups, and thus the 
assignments must be based on vague similarities of overall shape and form, a 
method that has frequently proved misleading in other cases.”*7 

Other leading paleontologists also doubt that the Cambrian animals 
descended from these Ediacaran forms. In a phylogenetic diagram showing the 


evolutionary relationship of Precambrian and Cambrian fossils, Oxford 
biologists Alan Cooper and Richard Fortey depict the Ediacaran fauna as lying 
on a line of descent separate from the Cambrian animals rather than being 
ancestral to them. In another paper, Fortey asserts that the beginning of the 
Cambrian “saw the sudden appearance in the fossil record of almost all the main 
types of animals (phyla) that still dominate the biota today.” He concedes that 
there are a variety of fossils in older strata, but insist that “they are either very 
small (such as bacteria and algae) or their relationships to the living fauna are 
highly contentious, as is the case with the famous soft-bodied fossils from the 
late Precambrian Pound Quartzite, Ediacara, South Australia.”2* 

Similarly, paleontologist Andrew Knoll and biologist Sean B. Carroll have 
argued: “It is genuinely difficult to map the characters of Ediacaran fossils onto 
the body plans of living invertebrates.”*? Although many paleontologists 
initially showed interest in the possibility that the Cambrian animal forms might 
have evolved from the Ediacaran organisms, paleontologist Peter Ward explains 
that “later study cast doubt on the affinity between these ancient remains 
preserved in sandstones [the Australian Ediacaran] and living creatures of today” 
(that is, animals representing phyla that first arose in the Cambrian).*° As Nature 
recently noted, if the Ediacaran fauna “were animals, they bore little or no 
resemblance to any other creatures, either fossil or extant.”?7 

This absence of clear affinities has led an increasing number of 
paleontologists to reject ancestor-descendant relationships between all but (at 
most) a few of the Ediacaran and Cambrian fauna. Nevertheless, some have 
suggested that trace fossils may establish a link. In an authoritative 2011 paper in 
the journal Science, Douglas Erwin and colleagues described the discovery of 
Ediacaran trace fossils consisting of surface tracks, burrows, fecal pellets, and 
feeding trails, which, they argue, though small, could only have been made by 
animals such as worms with a relatively high degree of complexity.*® On the 
basis of these findings, Erwin and other paleontologists have argued that these 
trace fossils suggest the existence of organisms with a head and tail, nervous 
systems, a muscular body wall allowing creeping or burrowing, and a gut with 
mouth and anus.*? Other paleontologists suggest that these characteristics may 
indicate the presence of a Precambrian mollusk or a worm phylum.°? 

Graham Budd, a British paleontologist who works at Uppsala University in 
Sweden, and others, have disputed these associations. Budd and geologist 
colleague Sdéren Jensen argue that many alleged trace fossils actually show 


evidence of inorganic origin: “There are numerous reports of older trace fossils, 
but most can be immediately shown to represent either inorganic sedimentary 
structures or metaphytes [land plants], or alternatively they have been 
misdated.”*! Still others have suggested that surface tracks and trails could have 
been left by mobile single-celled organisms, including a known form of a giant 
deep-sea protist that leaves bilaterian-like impressions. As one paper explains, 
“Some such traces date back to 1.5 billion to 1.8 billion years ago, which 
outdates even the boldest claims of the time of origin of animal multi-cellularity 
and forces researchers to contemplate the possibility of an inorganic or bacterial 
origin.”°* 

Even the most favorable interpretations of these trace fossils suggest that they 
indicate the presence of no more than two animal body plans (of largely 
unknown characteristics). Thus, the Ediacaran record falls far short of 
establishing the existence of the wide variety of transitional intermediates that a 
Darwinian view of life’s history requires. The Cambrian explosion attests to the 
first appearance of organisms representing at least twenty phyla and many more 
subphyla and classes, each manifesting distinctive body plans. In a best case, the 
Ediacaran forms represent possible ancestors for, at most, four distinct Cambrian 
body plans, even counting those documented only by trace fossils. This leaves 
the vast majority of the Cambrian phyla with no apparent ancestors in the 
Precambrian rocks (i.e., at least nineteen of the twenty-three phyla present in the 
Cambrian have no representative in Precambrian strata).°° 

Third, even if representatives of four animal phyla were present in the 
Ediacaran period, it does not follow that these forms were necessarily 
transitional or intermediate to the Cambrian animals. The Precambrian sponges 
(phylum Porifera), for example, were quite similar to their Cambrian brethren, 
thus demonstrating, not a gradual transformation from a simpler precursor or the 
presence of an ancestor common to many forms, but quite possibly only an 
earlier first appearance of a known Cambrian form. The same may be true of 
whatever kind of worm may be attested by Precambrian tracks and burrows. 

Moreover, even assuming, as some evolutionary biologists do,** that later 
Cambrian animals had a sponge-like Precambrian ancestor, the gap in 
complexity as measured by the number of cell types alone, to say nothing of the 
specific anatomical structures and distinct modes of body plan organization that 
are present in later animals but not in sponges, leaves a massive discontinuity in 
the fossil record that requires explanation (much like the morphological gap 
between Spriggina and actual arthropods). 


AN EDIACARAN MINI-EXPLOSION THE EDIACARAN FOSSILS 
THEMSELVES PROVIDE EVIDENCE OF A PUZZLING LEAP IN 
BIOLOGICAL COMPLEXITY, THOUGH NOT ONE NEARLY GREAT 
ENOUGH (OR OF THE RIGHT KIND) TO ACCOUNT FOR THE 
CAMBRIAN EXPLOSION. BEFORE ORGANISMS LIKE KIMBERELLA, 
DICKINSONIA, AND SPONGES APPEARED, THE ONLY LIVING FORMS 
DOCUMENTED IN THE FOSSIL RECORD FOR OVER 3 BILLION YEARS 
WERE SINGLE-CELLED ORGANISMS AND COLONIAL ALGAE. 
PRODUCING SPONGES, WORMS, AND MOLLUSKS FROM SINGLE- 
CELLED ORGANISMS IS A LITTLE LIKE TRANSFORMING A SPINNING 
TOP INTO A BICYCLE. THE BICYCLE ISN’T REMOTELY AS COMPLEX 
AS THE AUTOMOBILE SITTING BESIDE IT, BUT IT REPRESENTS AN 
ENORMOUS LEAP IN TECHNOLOGICAL SOPHISTICATION OVER THE 
SPINNING TOP. LIKEWISE, ALTHOUGH THE HUMBLE EDIACARAN 
BIOTA LOOK SIMPLE BESIDE MOST OF THE CAMBRIAN ANIMALS, 
THEY REPRESENT AN ENORMOUS LEAP IN FUNCTIONAL 
COMPLEXITY OVER THE SINGLE-CELLED ORGANISMS AND 
COLONIAL ALGAE THAT PRECEDED THEM. 


Thus, the Ediacaran biota attest to a separate sudden increase in biological 
complexity within a short window of geological time (about 15 million years), 
following roughly 3 billion years in which only single-celled organisms 
inhabited the earth.2° This leap in complexity, in a relatively short span of 
geological time, may well exceed the explanatory resources of natural selection 
working on random mutations. We will return to that question in Part Two. 

The Ediacaran fossils therefore do not solve the problem of the sudden 
increase in biological form and complexity during the Cambrian. Instead, they 
represent an earlier, if less dramatic, manifestation of the same kind of problem. 
To biology’s “big bang,”°° the Ediacaran biota add a significant “pow.” As 
paleobiologist Kevin Peterson, of Dartmouth College, and his colleagues note, 
these fauna represent “an apparent quantum leap in ecological complexity as 
compared with the ‘boring billions’ [of years] that characterize Earth before the 
Ediacaran,” even if these organisms are “still relatively simple when compared 
with the Cambrian,” which they characterize as another “quantum leap in 
organismal and ecological complexity.”°’ 

Many paleontologists now refer to the Ediacaran radiation as an explosion in 
its own right.°° This Precambrian “pow” makes the problem of fossil 


discontinuity only more acute, since credible intermediates leading to the 
Ediacaran layers are completely nonexistent in the even more sparsely populated 
Strata beneath them. 

Finally, even if one regards the appearance of the Ediacaran fossils as 
evidence of a “fuse” leading to the Cambrian explosion as some have 
proposed,”? the total time encompassed by the Ediacaran and Cambrian 
radiations still remains exceedingly brief relative to the expectations and 
requirements of a modern neo-Darwinian view of the history of life. As I will 
explain in more detail in Chapter 8, neo-Darwinism is the modern version of 
Darwin’s theory that invokes random genetic changes called mutations as the 
source of much of the new variation upon which natural selection acts. Like 
classical Darwinism, the neo-Darwinian mechanism requires great stretches of 
time to produce novel biological form and structure. Yet, current studies in 
geochronology suggest that only 40 to 50 million years elapsed between the 
beginning of the Ediacaran radiation (570—565 million years ago) and the end of 
the Cambrian explosion (525-520 million years ago).4° To anyone unfamiliar 
with the equations of population genetics by which neo-Darwinian evolutionary 
biologists estimate how much morphological change is likely to occur in a given 
period of time, 40 to 50 million years may seem like an eternity. But empirically 
derived estimates of the rate at which mutations accumulate imply that 40 to 50 
million years does not constitute anything like enough time to build the 
necessary anatomical novelties that arise in the Cambrian and Ediacaran periods. 
I will describe this problem in more detail in Chapter 12. 

Until recently, radiometric studies had estimated the duration of the Cambrian 
radiation itself at 40 million years, a period of time so brief, geologically 
speaking, that paleontologists had already dubbed it an “explosion.” The relative 
suddenness of the Cambrian explosion, even on the earlier measure of its 
duration, had already raised serious questions about the adequacy of the neo- 
Darwinian mechanism; consequently, it had also raised questions about whether 
a Darwinian understanding of the history of life could be reconciled with the 
Cambrian and Precambrian fossil record. Thus, treating the Ediacaran and the 
Cambrian radiations as one continuous evolutionary event (itself an 
unrealistically generous assumption) only returns the problem to its earlier (pre- 
zircon redating) status. 

For all these reasons, the late Precambrian fossils have not solved, but instead 
have deepened, the mystery of the origin of animal form. And few leading 
Cambrian paleontologists, of whom I was aware on that September evening in 


2009 while preparing to answer questions at the University of Oklahoma, 
thought otherwise. 


EDIACARAN EXOTICA SO WHAT ABOUT THE CLAIM THAT CERTAIN 
EXOTIC EDIACARAN FOSSILS ARE PLAUSIBLE ANCESTORS TO THE 
CAMBRIAN ANIMAL FORMS, EVEN IF BETTER-KNOWN EDIACARAN 
FORMS SUCH AS DICKINSONIA, CHARNIA, AND SPRIGGINA ARE 
PROBABLY NOT? DID THESE EXOTIC FORMS SOLVE THE MYSTERY 
OF THE CAMBRIAN EXPLOSION? 


Only a few years before my visit to the University of Oklahoma, I had written 
a scientific review article with research help from several colleagues, including a 
paleontologist and a marine biologist.*! (The latter was Paul Chien, who helped 
discover the Precambrian sponge embryos discussed in the previous chapter.) In 
our review article, I explained many of the problems with treating the Ediacaran 
as transitional intermediates discussed above. In the process of doing the 
research for that article, my colleagues and I encountered few paleontologists 
who thought that Parvancorina, Arkarua (see Fig. 4.2), or Vernanimalcula 
represented definitive ancestors of the Cambrian bilaterians, arthropods, or 
echinoderms. Could we have missed something? 

In fact, leading Cambrian authorities have dismissed associations between 
these odd fossil forms and the Cambrian animals. Nevertheless, in his talk before 
the showing of our film, the local professor from the University of Oklahoma 
asserted that the rather indistinct fossil form found in the Ediacaran Hills called 
Parvancorina represented a plausible ancestor of the arthropods. Some have 
described Parvancorina as a shield-shaped fossil form with a raised anchor- 
shaped ridge impressed atop it, bearing a superficial resemblance in its shape to 
that of a trilobite—thus, the claim that it might have represented an early 
arthropod. Yet leading Cambrian paleontologists dispute this association. 
Cambrian expert James Valentine has argued that Parvancorina is not 
convincing as an arthropod ancestor, and for good reason. Parvancorina fossils 
lack a head, jointed limbs, and compound eyes, all distinctive features of 
arthropods. Thus, Valentine noted that Parvancorina fossils “have not been 
shown to share derived features” with arthropods.*¢ 








FIGURE 4.2 

Figure 4.2a (left): Photograph of Arkarua fossil, courtesy Taylor & Francis, Ltd. 
Figure 4.2b (right): Photograph of Parvancorina fossil, courtesy Peterson, K. J., 
Cotton, J. A., Gehling, J. G., and Pisani, D., “The Ediacaran Emergence of 
Bilaterians: Congruence between the Genetic and the Geological Fossil 
Records,” Philosophical Transactions of the Royal Society B, 2008, 363 (1496): 
1435-43, Figure 2, by permission of the Royal Society. 


Valentine makes much the same point about the small disc-like imprint called 
Arkarua, one of the other Ediacaran forms cited by the University of Oklahoma 
professor that night at the Sam Noble Museum. Valentine points out that it too 
lacks many distinctive features of the animal phylum to which it is typically 
assigned. Indeed, those who propose Arkarua as an ancestor of the Cambrian 
animals usually claim that it represents an early echinoderm (as the professor in 
Oklahoma did). Echinoderms include starfish, sand dollars, and other animals 
with five-fold symmetry extending from a central body cavity.*? Some have 
perceived five tiny segmented divisions within the circular impressions left by 
Arkarua, making them seem roughly similar to some modern echinoderms. But 
that similarity has proven superficial at best. Other paleontologists observe that 
Arkarua lacks a stereom, or water vascular system, a definitive diagnostic 
feature of echinoderms; thus, its “echinoderm-specific features are not readily 
visible.”44 Valentine has argued that, absent such telltale features, the 
relationship of Arkarua to echinoderms “remains uncertain.”*° 


In the case of Vernanimalcula, the story is more complicated but equally 
problematic. Vernanimalcula is the name that Chinese paleontologists gave to an 
imprint in phosphorite sediment found in the Doushantuo Formation in 2004. 
They found the structure in 580-to 600-million-year-old rocks, making the 
impression even older than the Ediacaran strata. The paleontologist David 
Bottjer of the University of Southern California, and some Chinese 
paleontologists (at least, initially), speculated that the Vernanimalcula imprint 
might be the remains of an early bilaterian.*° 

Recall that bilaterians are animals whose parts found on one side of the body 
midline are also found in mirror image on the other (as opposed to, say, a 
radially symmetric animal*’). Figure 4.3 shows a picture of the structure of 
Vernanimalcula first found in the Doushantuo Phosphorite formation. Some 
paleontologists think that Vernanimalcula exhibits such bilateral symmetry and 
thus might be ancestral to the bilaterian animals that later first appeared in the 
Cambrian period. 

But problems have emerged with this argument. First, the form of the 
Vernanimalcula does not resemble any specific bilaterian animal. In addition, 
recent scientific analyses of these remains have questioned whether this imprint 
preserves the remains of animals and, therefore, bilaterians at all. For example, 
in 2004, Stefan Bengtson and Graham Budd, two paleontologists and Cambrian 
experts, published a detailed chemical and microscopic analysis of these fossils 
in the journal Science.*® They concluded that the structures preserved in 
phosphorite rocks had undergone significant alteration by so-called diagenesis 
and taphonomic processes. Diagenesis refers mainly to processes of chemical 
alteration that occur after sediments are deposited and before sedimentary rocks 
are fully hardened, or “lithified.” Taphonomic processes are those that alter once 
living organisms after burial and preservation in sediments. 





FIGURE 4.3 
Photograph of Vernanimalcula fossil. Courtesy American Association for the 
Advancement of Science, from Chen, J.-Y., Bottjer, D. J., Oliveri, P., Dornbos, 
S. Q., Gao, F., Ruffins, S., “Small Bilaterian Fossils from 40 to 55 Million Years 
Before the Cambrian,” Science, 305 (July 9, 2004): 218-22, Figure 1b. 
Reprinted with permission from AAAS. 


Based on their microscopic analysis, Bengtson and Budd rejected the 
hypothesis that these structures preserved the remains of an animal form. 
Instead, they argued that the phosphorite imprint exhibited distinctive features of 
the chemically altered remains of one-celled microfossils—microfossils that had 
been encrusted with layers of chemical residue from various diagenetic 
processes.*? 

More recently, in 2012, Bengtson and three other colleagues published 
another paper sharply critical of the view that Vernanimalcula represents an 
ancestor of the bilaterian animals—or even an animal. They show that the 
“structures key to animal identity are effects of mineralization that do not 
represent biological tissues.” For this reason they conclude: “There is no 
evidential basis for interpreting Vernanimalcula as an animal, let alone a 


bilaterian.”°? 


Though the paper was titled “A Merciful Death for the ‘Earliest Bilaterian,’ 
Vernanimalcula,” the authors were anything but merciful in wielding their 
arguments. Their article upbraided David J. Bottjer, the main paleontologist who 
has promoted the interpretation of Vernanimalcula as a bilaterian ancestor, for 
seeing what he wanted to see and disregarding the clear evidence of 
nonbiological mineralization. In a 2005 Scientific American article, Bottjer 
interpreted Vernanimalcula as the “oldest fossil animal with a bilaterian body 
plan yet discovered.” In that article, Bottjer claimed that Vernanimalcula 
confirmed the “suspicion that complex animals have a much deeper root in time” 
and “that the Cambrian was less of an explosion and more of a flowering of 
animal life.”°! After unequivocally rejecting Bottjer’s interpretation on the basis 
of their geochemical analysis, Bengtson and his coauthors rebuked Bottjer in 
rather personal terms: It is likely that the fossils referred to [as] Vernanimalcula 
were interpreted as bilaterians because this was . . . the explicit quarry of its 
authors. If you know from the beginning not only what you are looking for, but 
what you are going to find, you will find it, whether or not it exists. As Richard 
Feynman (1974) famously remarked: “The first principle is that you must not 
fool yourself—and you are the easiest person to fool.” . .. Once you have fooled 
yourself you will fool other scientists.°* 

Bengtson and his colleagues insist that however much a few paleontologists 
such as Bottjer might have wanted to “heap” “evolutionary significance” on 
Vernanimalcula in order to relieve their cognitive dissonance about the 
Cambrian explosion, the evidence does not bear the weight of interpretation that 
had been placed upon it. Thus, they conclude Vernanimalcula should be “laid to 
a merciful rest,” since the interpretation of it had “taken on a life of its own, a 
life it never had.”°? 


A DEEPER PROBLEM THOUGH BACK IN 2009, JONATHAN WELLS 
AND I DIDN’T KNOW ABOUT THE MOST RECENT CRITICAL 
ANALYSIS OF VERNANIMALCULA’S PRETENTIONS, WE DID KNOW 
THAT MANY LEADING PALEONTOLOGISTS HAD REJECTED 
ATTEMPTS TO IDENTIFY VERNANIMALCULA AS AN ANIMAL FORM. 
THUS, DURING THE QUESTION-AND-ANSWER PERIOD FOLLOWING 
THE FILM, JONATHAN WELLS EXPLAINED WHY THESE AND OTHER 
OBSCURE AND ENIGMATIC PRECAMBRIAN FOSSILS (OR IMPRINTS) 


FAILED TO QUALIFY AS CONVINCING PRECURSORS TO ANY OF THE 
CAMBRIAN ANIMALS, CITING THE WORK OF LEADING 
AUTHORITIES IN PALEONTOLOGY. IN EACH CASE, HE NOTED THAT 
SIMILARITIES BETWEEN THE EDIACARAN FORMS AND LATER 
CAMBRIAN ANIMALS HAD PROVEN SUPERFICIAL, BECAUSE THE 
EDIACARAN FORMS LACKED MANY KEY DIAGNOSTIC FEATURES OF 
SPECIFIC CAMBRIAN PHYLA. 


At the same time, as I later reflected on the lecture, I recognized a deeper 
problem with attempts to resolve the mystery of the Cambrian explosion by 
pointing to a few Precambrian fossils. Many defenders of the Darwinian picture 
of the history of life seemed to assume that the discovery of any alleged 
Precambrian animal forms, however implausible as ancestors of specific 
Cambrian animals or however sparsely distributed in the vast sequences of 
Precambrian strata, would solve the mystery of the Cambrian explosion, 
especially if these forms exemplified some abstractly perceived commonality 
such as bilateral symmetry. 

To see what’s wrong with this way of thinking, imagine an ambitious distance 
swimmer claiming that it would be possible to swim between California and 
Hawaii over a period of many months or years because of the small islands that 
provide way stations where he could eat, rest, and overnight at each stage along 
his marathon journey. But instead of showing that an archipelago dotting the 
route between California and Hawaii at reasonably accessible intervals actually 
exists, he points to a couple of barren atolls in the South Pacific far from the 
most likely course to Hawaii. Clearly, in that case, the claims of our intrepid 
hypothetical swimmer would not be credible. In a similar way, the claims of 
those who assert that a few isolated and anatomically enigmatic forms of life in 
the Precambrian solve the problem of the Cambrian explosion also lack 
credibility. 

To appreciate another aspect of this problem, let’s revisit the claims about 
Vernanimalcula as a possible ancestor of all the bilaterian phyla. On the one 
hand, for such a form to qualify as the ancestor common to a large number of 
specific phyla (such as the many bilaterian phyla that arise in the Cambrian), it 
must exhibit the basic bilaterian characteristics such as bilateral symmetry and 
what is called “triploblasty,” the presence of three distinct tissue layers 
(endoderm, mesoderm, and ectoderm). At the same time, a viable candidate for 
the role of common ancestor cannot by definition manifest any of the 


differentiating characteristics that distinguish the individual Cambrian phyla and 
their respective body plans from each other. For example, any bilaterian that 
manifests the characteristic exoskeleton of, say, an arthropod cannot also qualify 
as a plausible ancestor of a chordate, because chordates have internal skeletons 
or notochords. The logic of these distinct body designs precludes sharing both 
anatomical characteristics. For this reason, any hypothetical bilaterian common 
ancestor could only have existed as a kind of lowest anatomical common 
denominator, or what evolutionary biologists call a “ground plan,” having only 
those few features that are common to all of the animal forms that allegedly 
evolved from it. 

But this creates a dilemma. If a fossilized form is simple enough to qualify as 
the common ancestor of later highly differentiated bilaterian phyla, then it will 
necessarily lack most of the important distinguishing anatomical features of 
those specific phyla. That means that all the interesting anatomical novelties that 
differentiate one phylum from another must arise along the separate lineages 
branching out from the alleged common ancestor well after its origin in the fossil 
record. Heads, jointed limbs, compound eyes, guts, anuses, antennae, 
notochords, stereoms, lophophores (a tentacled feeding organ), and numerous 
other distinguishing characteristics of many different animals must come later on 
many distinct lines of descent. Yet the gradual evolutionary origin of these 
characteristics is not documented in the Precambrian fossil record. These 
characteristics do not appear until they arise suddenly in the Cambrian 
explosion. 

For this reason, indistinct fossils such as Vernanimalcula—even if we take 
them as representing a common ancestor of many bilaterians—document little of 
the Darwinian story of the history of animal life. Hugely significant gaps in the 
fossil record would still remain, because the Precambrian fossil record simply 
does not document the gradual emergence of the crucial distinguishing 
characteristics of the Cambrian animals. The important anatomical novelties that 
define the individual Cambrian phyla as well as their first clear representatives 
arise as suddenly as ever. 

To say that a form such as Vernanimalcula, or any of the other relatively 
indistinct Ediacaran forms, solves the problem of the missing Precambrian fossil 
record would be a bit like saying that a metal cylinder demonstrates all the steps 
involved in the construction of a toaster, automobile, submarine, or jet airplane 
simply because all these technological objects utilize “metal enclosures.” It’s 
true that each of these complex systems uses metal enclosures, but the presence 


of an enclosed metal surface is only a necessary, and not nearly a sufficient, 
condition of the origin of these various technological systems. Similarly, finding 
a simple but otherwise unadomed bilaterally symmetric form of life would 
hardly solve the problem of fossil discontinuity, because it would not by itself 
document the emergence of the unique characteristics of the individual bilaterian 
animals. 

This paradox is well known to paleontologists who work on the Cambrian 
radiation. Charles Marshall and James Valentine, for instance, describe the 
difficulty of attempting to characterize an “undiagnostic” group, by which they 
mean a possible ancestral “stem” group that lacks the specialized characteristics 
of its presumptive evolutionary progeny. They write: When trying to unravel the 
origins of the animal phyla . . . the hardest to examine is the phase between the 
actual cladogenic origin of a phylum and the time that it acquired its first 
phylum-specific characteristic(s). Even if we have fossils from this phase in a 
phylum’s history, we will not be able to prove their kinships at the level of 
phyla.*4 
Thus, even if Vernanimalcula, or some other fossil form, were simple enough 
and animal-like enough to qualify as a so-called ur (or original) form of animal 
life, it would paradoxically, for just that reason, be incapable of establishing 
itself as an unequivocal common ancestor of some specific Cambrian phylum. 

And there’s no relief in the other direction. If an alleged ancestral form 
manifests the distinguishing features of one of the specific Cambrian phyla—if, 
for example, Vernanimalcula or some other isolated form presented a convincing 
set of distinctively arthropod or chordate or echinoderm characteristics, then the 
very presence of those features would necessarily preclude the possibility of that 
specific animal form representing the common ancestor of all other Cambrian 
forms. The more an animal form manifests the characteristics of one phylum or 
group within the phylum, the less plausible it becomes as the ancestor of all the 
other animal phyla. 

And that is the dilemma in a nutshell. Highly differentiated and complex 
Precambrian forms by themselves could not have been ancestors common to all 
the Cambrian phyla; whereas undifferentiated forms simple enough to have been 
ancestral to all the Cambrian phyla leave no evidence, by themselves, of the 
gradual emergence of the complex anatomical novelties that define the Cambrian 
animals. Either way—whether the few alleged Precambrian ancestors are viewed 
as simple and relatively undifferentiated or complex and highly differentiated— 
the fossil record, given its otherwise pervasive pattern of discontinuity, does not 


establish the gradual evolution of numerous anatomical and morphological 
novelties. Instead, only a true series of transitional intermediates in which the 
fossil record documents both the existence of an original animal form and the 
gradual appearance of the key distinguishing anatomical features and novelties 
(and the Cambrian animals themselves) would remedy this deficiency. And yet 
that is precisely what the Precambrian fossil record has failed to document. 

As Graham Budd and _ Soren’ Jensen state, “The known 
[Precambrian/Cambrian] fossil record has not been misunderstood, and there are 
no convincing bilaterian candidates known from the fossil record until just 
before the beginning of the Cambrian (c. 543 Ma), even though there are 
plentiful sediments older than this that should reveal them.”°? Thus they 
conclude, “The expected Darwinian pattern of a deep fossil history of the 
bilaterians, potentially showing their gradual development, stretching hundreds 
of millions of years into the Precambrian, has singularly failed to materialize.”°© 


DILEMMA ON DISPLAY DURING THE QUESTION-AND-ANSWER 
SESSION THAT FOLLOWED THE SCREENING OF DARWIN’S DILEMMA, 
NONE OF THE UNIVERSITY OF OKLAHOMA PH.D. STUDENTS OR 
SCIENCE FACULTY WHO ATTENDED THE MUSEUM-SPONSORED 
LECTURE CHALLENGED MY COLLEAGUE JONATHAN WELLS WHEN 
HE EXPLAINED WHY LEADING PALEONTOLOGISTS DO NOT THINK 
THE EXOTIC PRECAMBRIAN FORMS CITED IN THE LECTURE WERE 
ANCESTORS OF CAMBRIAN FORMS. THIS NONREACTION SEEMED A 
LITTLE ODD GIVEN THE STRESS THE MUSEUM’S OWN EXPERT HAD 
LAID ON THESE CLAIMS, AND GIVEN THAT HE HAD MADE THESE 
CLAIMS RATHER EMPHATICALLY IN THE SAME BUILDING TO MANY 
OF THE SAME PEOPLE JUST THREE HOURS EARLIER. 


On our flight out of town the next day, Jonathan Wells told me something that 
cast our experience there in an even odder light. He’d had a chance to walk 
around the Sam Noble Science Museum after the lecture and before our event. 
He discovered that the museum has a display that vividly illustrates the severity 
of what we called Darwin’s dilemma. Wells recorded some of his observations 
on his return to Seattle. One part of the account of what he saw while touring the 
museum’s own display about the Cambrian explosion is worth quoting at length: 
[The display] seemed factually accurate for the most part, emphasizing (among 
other things) that many of the Cambrian explosion fossils were soft-bodied— 


which puts the lie to the common explanation that their precursors are absent 
from the fossil record because they lacked hard parts. The exhibit also made it 
clear that the Ediacaran fossils went extinct at the end of the pre-Cambrian, so 
(with a few possible exceptions) they could not have been ancestral to the 
Cambrian phyla. One particular panel in the exhibit caught my attention. It 
showed over a dozen of the Cambrian phyla at the top of a branching tree with a 
single trunk, but none of the branch points corresponded to a real living thing. 
Instead, the branch points were artificial technical categories such as 
“Ecdysozoa,” “Lophotrochozoa,” “Deuterostomia,” and “Bilateria.” The 
artificiality of the branch points emphasized that the branching-tree pattern 
imposed on the fossil evidence was itself an artificial construct. 

So, after all the controversy, it turned out that the museum that had sponsored 
the lecture denying Darwin’s Cambrian dilemma itself has an excellent display 
indicating that the expected ancestral forms of the Cambrian animals—the very 
ones that Darwin hoped to find a hundred and fifty years ago—are still missing 
from the Precambrian fossil record. But then why would the museum sponsor 
the presentation that it did? It’s hard to say, I suppose, but I’ve seen this dynamic 
before in discussions about Darwinian evolution. Evolutionary biologists will 
acknowledge problems to each other in scientific settings that they will deny or 
minimize in public, lest they aid and abet the dread “creationists” and others they 
see as advancing the cause of unreason. Perhaps just our presence on campus 
raising questions about contemporary Darwinism made them feel defensive on 
behalf of “science.” It’s an understandable, if ironic, human reaction, of course, 
but one that in the end deprives the public of access to what scientists actually 
know. It also perpetuates the impression of evolutionary biology as a science that 
has settled all the important questions at just the time when many new and 
exciting questions—about the origin of animal form, for example—are coming 
to the fore. 


5 
The Genes Tell the Story? 


Reconstructing the history of life has a lot in common with detective work. 
Neither detectives nor evolutionary biologists can directly observe the events in 
the past that interest them most. Detectives typically did not see the crime occur. 
Evolutionary biologists did not witness the origin of animals or other groups of 
organisms. Yet this limitation doesn’t mean either group of investigators lacks 
the evidence to determine with some confidence what happened. Detectives and 
evolutionary biologists as well as many other historical scientists— 
paleontologists, geologists, archeologists, cosmologists, and forensic scientists— 
do this regularly, based on careful inference from the clues or evidence left 
behind. 

Many evolutionary biologists have commented on the forensic nature of their 
work. Here’s how Richard Dawkins puts it: “I have used the metaphor of a 
detective, coming on the scene of the crime after it is all over and reconstructing 
from the surviving clues what must have happened.”! 

Perhaps the most obvious surviving traces of ancient life are fossils. But as 
evolutionary biologists and paleontologists have come to realize that the 
Precambrian fossil record has not furnished the confirmation that Darwin hoped 
it would, many have looked to other kinds of clues to establish the gradual 
emergence of Cambrian animal life from a common ancestor. 

In this effort, contemporary evolutionary biologists have followed the 
example of Darwin himself. Though Darwin argued that a general progression of 
simpler to more complex forms of life in the fossil record meshed nicely with his 
theory, he was acutely aware that the discontinuity of the fossil record, 
particularly as evidenced in Precambrian and Cambrian strata, did not. This is 
why he emphasized other types of evidence to establish his theory of universal 
common descent. 

In a famous chapter in On the Origin of Species titled “The Mutual Affinities 


of Organic Beings,” Darwin made his case not on the basis of the fossil 
evidence, but on the basis of similar anatomical structures in many distinct 
organisms. He noted, for example, that the forelimbs of frogs, horses, bats, 
humans, and many other vertebrates exhibited a common five-digit 
(“pentadactyl”) structure or organization (see Fig. 5.1). To explain such 
“homologies,” as he called them, Darwin posited a vertebrate ancestor that 
possessed pentadactyl limbs in rudimentary form. As a menagerie of modern 
vertebrates evolved from this common ancestor, each retained in its own way the 
basic pentadactyl mode of organization. For Darwin, his theory of descent with 
modification from a common ancestor explained these similarities better than the 
received view of many older nineteenth-century biologists such as Louis Agassiz 
or Richard Owen, both of whom thought homologies reflected the common 
design plan of a creative intelligence. 
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FIGURE 5.1 
The common five-digit pattern of the pentadactyl limb as manifested in four 
modern animals. Copyright Jody F. Sjogren. Used with permission. 


In reconstructing the evolutionary history of life, most evolutionary biologists 
today emphasize the importance of homology. They assume that similarities in 


anatomy and in the sequences of information-bearing biomacromolecules such 
as DNA, RNA, and protein point strongly to a common ancestor.* They also 
assume that the degree of difference in such cases is on average proportional to 
the time elapsed since the divergence from a common ancestor. The greater the 
difference in the common feature or molecular sequence, the farther back the 
ancestor from which the feature or sequence arose. 

Evolutionary biologists have used this approach to try to discern the 
evolutionary history of the Cambrian animals. If the Precambrian fossil record 
refuses to disclose the secrets of Precambrian evolution, so goes the thinking, 
perhaps the study of comparative anatomy and molecular homologies will. 
Given the well-established problem with the fossil evidence, many evolutionary 
biologists now particularly emphasize the importance of clues from molecular 
genetics. As evolutionary biologist Jerry Coyne, of the University of Chicago, 
notes, “Now we have a powerful, new, and independent way to establish 
ancestry: we can look directly at the genes themselves. By sequencing the DNA 
of various species and measuring how similar these sequences are, we can 
reconstruct their evolutionary relationships.”° 

There are two aspects of this endeavor. First, by analyzing the genes of 
existing animals representing phyla that first arose in the Cambrian, scientists 
have attempted to establish when the common ancestor of the Cambrian animal 
forms lived. This effort has generated what is known as the “deep-divergence 
hypothesis,” which holds that the common ancestor of all animal life arose long 
before the Cambrian explosion. Second, by analyzing anatomical and molecular 
similarities, biologists have attempted to reconstruct the Precambrian—Cambrian 
tree of life, mapping the course of evolution during a cryptic period before the 
Cambrian. 

Defenders of neo-Darwinism assert that these techniques have produced a 
coherent evolutionary picture of the early history of animal life. They assert that 
clues from the realm of genetics point unequivocally to Precambrian ancestral 
forms and to an evolutionary history that fossils have failed to document. 

This chapter will examine what genes tell us about the alleged universal 
common ancestor of all animals; the next chapter will consider whether the 
analysis of genes (and other features of organisms) yield a coherent treelike 
picture of the Precambrian prehistory of animal life. Genetic analyses have 
indeed revealed a trove of clues. The question is: Do those genetic clues 
establish the Precambrian ancestor and history that fossils have failed to 
document, or, as sometimes occurs in criminal investigations, has there been a 


rush to judgment? 


DEEP DIVERGENCE MANY PALEONTOLOGISTS AND 
EVOLUTIONARY BIOLOGISTS NOW CONCEDE THAT THE LONG- 
SOUGHT-AFTER PRECAMBRIAN FOSSILS, THOSE NECESSARY TO 
DOCUMENT A DARWINIAN ACCOUNT OF THE ORIGIN OF ANIMAL 
LIFE, ARE MISSING.* SCIENTISTS ARE ESPECIALLY CANDID ABOUT 
THIS WHEN ADDRESSING EACH OTHER IN THE TECHNICAL PEER- 
REVIEWED LITERATURE. OFTEN, HOWEVER, DEFENDERS OF 
EVOLUTIONARY ORTHODOXY RAISE ANOTHER POSSIBILITY—THAT 
THE COMMON ANCESTOR OF THE CAMBRIAN ANIMALS HAS BEEN 
DOCUMENTED AFTER ALL, NOT BY FOSSIL EVIDENCE, BUT BY 
MOLECULAR OR GENETIC EVIDENCE OF WHAT THEY CALL A “DEEP 
DIVERGENCE” OF ANIMAL LIFE. INMAKING SUCH CLAIMS, THESE 
BIOLOGISTS CLEARLY PRIVILEGE MOLECULAR EVIDENCE OVER 
EVIDENCE FROM THE FOSSIL RECORD. 


Proponents of deep divergence don’t deny that the fossil evidence has come 
up short. Instead, they adopt one of the versions of the artifact hypothesis to 
account for that missing evidence. They then argue that there was no “explosion” 
of animal forms in the Cambrian, but rather a “long fuse” of animal evolution 
and diversification lasting many millions of years leading up to what only looks 
like an “explosion” of animal life in the Cambrian, but this evolutionary history 
was hidden from the fossil record. Indeed, they argue that molecular evidence 
establishes a long period of undetected or cryptic evolution in Precambrian 
times, beginning from a common ancestor some 600 million to 1.2 billion years 
ago, depending upon which study of the molecular genetic data they cite. If 
correct, the Cambrian phyla may have had many hundreds of millions of years to 
evolve from a common ancestor.” 


THE MOLECULAR CLOCK ADVOCATES OF DEEP DIVERGENCE USE 
A METHOD OF ANALYSIS KNOWN AS THE “MOLECULAR CLOCK.” 
MOLECULAR CLOCK STUDIES ALSO ASSUME THAT THE EXTENT TO 
WHICH SEQUENCES DIFFER IN SIMILAR GENES IN TWO OR MORE 
ANIMALS REFLECTS THE AMOUNT OF TIME THAT HAS PASSED 
SINCE THOSE ANIMALS BEGAN TO EVOLVE FROM A COMMON 
ANCESTOR. A SMALL DIFFERENCE MEANS A SHORT TIME; A BIG 


DIFFERENCE, A LONG TIME. TO DETERMINE EXACTLY HOW SHORT 
OR LONG, THESE STUDIES ESTIMATE THE MUTATION RATE BY 
ANALYZING GENES IN TWO SPECIES OR TAXA THAT ARE THOUGHT 
TO HAVE EVOLVED FROM AN ANCESTOR WHOSE PRESENCE IN THE 
FOSSIL RECORD CAN BE DISCERNED AND DATED ACCURATELY. FOR 
EXAMPLE, MANY MOLECULAR-CLOCK STUDIES OF BIRDS AND 
MAMMALS ARE CALIBRATED BASED ON THE AGE OF AN EARLY 
REPTILE THOUGHT TO BE THE MOST RECENT COMMON ANCESTOR 
OF BOTH. 


THE 
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FIGURE 5.2 

The idea behind the molecular clock. The two animals and their homologous 
gene sequences at the right of the figure show the molecular distance between 
two present-day animals, that is, how many mutational differences have 
accumulated over time since they diverged on the tree of life. The animal at the 
left of the figure (a mammal-like reptile) represents the common ancestor from 
which these animals presumably evolved. Knowing how long ago the common 
ancestor (the mammal-like reptile) lived, and how many mutational differences 
have accumulated in its descendants during that time, allows scientists to 
calculate a mutation rate. In theory, once the mutation rate has been determined, 
it can be used to calculate the divergence time of other present-day species, after 
their homologous genes have been compared for differences. 


Genetic comparisons enable evolutionary biologists to estimate the number of 
mutational changes since divergence, and dating of the strata containing 
presumed fossil ancestors tells how long ago the divergence occurred. Assuming 
that different lineages evolve at the same rate,° together the two pieces of 
information enable evolutionary biologists to calculate a baseline mutation rate. 
They can then use that rate to determine how long ago some other pair of 
animals diverged from each other on the evolutionary tree (see Fig. 5.2).” 

Advocates of the deep-divergence hypothesis have applied this method to 
analyzing similar genes, RNA molecules, or proteins in pairs of animals 
belonging to phyla that first arose in the Cambrian period. In this way they 
estimate how long it took for the different animal phyla to diverge from a 
common Precambrian ancestor. 


DEEP AND DEEPER: EVIDENCE FOR DEEP DIVERGENCE IN THE 
1990S, EVOLUTIONARY BIOLOGISTS GREGORY A. WRAY, JEFFREY S. 
LEVINTON, AND LEO H. SHAPIRO PERFORMED A MAJOR STUDY OF 

CAMBRIAN-RELEVANT MOLECULAR SEQUENCE DATA. IN 1996, 
THEY PUBLISHED THEIR RESULTS IN A PAPER ENTITLED 
“MOLECULAR EVIDENCE FOR DEEP PRECAMBRIAN DIVERGENCES 
AMONG METAZOAN PHYLA.”® WRAY’S TEAM COMPARED THE 
DEGREE OF DIFFERENCE BETWEEN THE AMINO-ACID SEQUENCES 
OF SEVEN PROTEINS? DERIVED FROM SEVERAL DIFFERENT 
MODERN ANIMALS REPRESENTING FIVE CAMBRIAN PHYLA 
(ANNELIDS, ARTHROPODS, MOLLUSKS, CHORDATES, AND 

ECHINODERMS). THEY ALSO COMPARED THE NUCLEOTIDE BASE 
SEQUENCES OF A RIBOSOMAL RNA MOLECULE!” FROM THE SAME 

ANIMAL REPRESENTATIVES OF THE SAME FIVE PHYLA. 


The Wray study concluded that the common ancestor of the animal forms 
lived 1.2 billion years ago, implying that the Cambrian animals took some 700 
million years to evolve from this “deep-divergence” point before first appearing 
in the fossil record. Wray and his colleagues attempted to explain the absence of 
fossil ancestral forms during this period of time by postulating that Precambrian 
ancestors existed in exclusively soft-bodied forms, rendering their preservation 
unlikely. 

More recently, Douglas Erwin and several colleagues performed a study 
comparing the degree of sequence difference between other genes—seven 


nuclear housekeeping genes!! and three ribosomal RNA genes! across 113 


different species of living Metazoa. (The term “Metazoa” refers to animals with 
differentiated tissue. The term “metazoan” refers to one such animal or can be 
used as an adjective, as in “the metazoan phyla.”) They estimated that “the last 
common ancestor of all living animals arose nearly 800 million years ago.”!° 

Many similar studies affirm a very ancient or “stratigraphically deep” 
divergence of the animal forms, in opposition to those who claim that the 
Cambrian animals appeared suddenly within just a few million years.'* Each of 
these studies affirms the gradual emergence of animal life that most researchers 
expected to find on the basis of a Darwinian picture of the history of animal life. 
Indeed, a major aim of the Wray study was to challenge the view “that the 
animal phyla diverged in an ‘explosion’ near the beginning of the Cambrian 
period.”!° Wray and his colleagues argue instead that “all mean divergence time 
estimates between these four phyla and chordates, based on all seven genes, 
substantially predate the beginning of the Cambrian period.”!° They conclude: 
“Our results cast doubt on the prevailing notion that the animal phyla diverged 
explosively during the Cambrian or late Vendian, and instead suggest that there 
was an extended period of divergence during the mid-Proterozoic, commencing 
about a billion years ago.” !7 

From an orthodox Darwinian point of view, the conclusions of these studies 
seem almost unavoidable since (1) the neo-Darwinian mechanism requires vast 
amounts of time to produce anatomical novelty and (2) such phylogenetic 
analyses assume that all the animal forms descended from a common ancestor. 
Many evolutionary biologists claim that clues long hidden in DNA now confirm 
these Darwinian axioms and, consequently, the existence of an extremely 
ancient, Precambrian ancestor of the Cambrian animals. As Andrew Knoll, a 
Harvard paleontologist, states, “The idea that animals should have originated 
much earlier than we see them in the fossil record is almost inescapable.” !® 


REASONABLE DOUBT NEVERTHELESS, THERE IS NOW GOOD 
REASON TO DOUBT THIS ALLEGEDLY OVERWHELMING GENETIC 
EVIDENCE. IN THE IDIOM OF OUR FORENSIC METAPHOR, OTHER 

MATERIAL WITNESSES (FOSSILS) HAVE ALREADY COME FORWARD 
TO TESTIFY, THE TESTIMONY OF THE GENES (AND OTHER KEY 
INDICATORS OF BIOLOGICAL HISTORY) IS GROSSLY INCONSISTENT, 
AND THAT GENETIC TESTIMONY HAS COME TO US THROUGH A 


TRANSLATOR, WHO IS SHAPING THE WAY THE JURY PERCEIVES THE 
EVIDENCE. LET’S LOOK AT EACH OF THESE PROBLEMS IN TURN. 


Fossil Testimony Recall that the deep-divergence hypothesis has two 
components. One of them—a version of the artifact hypothesis—provides an 
explanation for why the Precambrian ancestral fossils are missing. And here the 
deep-divergence hypothesis first runs into trouble. As we saw in Chapter 3, there 
is no currently plausible version of the artifact hypothesis. The preservation of 
numerous soft-bodied Cambrian animals as well as Precambrian embryos and 
microorganisms undermines the idea of an extensive period of undetected soft- 
bodied evolution. In addition, the claim that exclusively soft-bodied ancestors 
preceded the hard-bodied Cambrian forms remains anatomically implausible. A 
brachiopod cannot survive without its shell. An arthropod cannot exist without 
its exoskeleton. Any plausible ancestor to such organisms would have likely left 
some hard body parts, yet none have been found in the Precambrian. Yet the 
deep-divergence hypothesis, whatever its other merits, requires a viable artifact 
hypothesis to explain the absence of fossilized Precambrian ancestors. 


The Testimony of Genes: Conflicting Stories There is a second, more telling 
reason to doubt the deep-divergence hypothesis: the results of different 
molecular studies have generated widely divergent results. Yet presumably there 
was only one common ancestor of all the Metazoa and only one ultimate 
divergence point. 


For example, comparing the Wray-led study and the Erwin-led study 
generates a difference of 400 million years. In the case of other studies, even 
greater differences emerge. Many other studies have thrown their own widely 
varying numbers into the ring, placing the common ancestor of animals 
anywhere between 100 million and 1.5 billion years before the Cambrian 
explosion (some molecular clock studies, oddly, even place the common 
ancestor of the animals after the Cambrian explosion).!? As Douglas Erwin, 
writing with fellow paleontologists James Valentine and David Jablonski, 
acknowledged in 1999: “Attempts to date those branching[s]” from a common 
Precambrian ancestor “by using molecular clocks have disagreed widely.”° 
How can this be? 

In the first place, different studies of different molecules generate widely 
divergent dates. In addition to the studies I have already cited, a 1997 paper by 


Japanese biologist Naruo Nikoh and colleagues examined two genes (aldolase 
and triose phosphate isomerase), and dated the split between eumetazoa and 
parazoa—animals with tissues (like cnidarians) from those without (like 
sponges)—at 940 million years ago.*! Compare that to a 1999 paper by Daniel 
Wang, Sudhir Kumar, and S. Blair Hedges based on the study of 50 different 
genes, showing that “the basal animal phyla (Porifera, Cnidaria, Ctenophora) 
diverged between about 1200-1500 Ma.” 

Sometimes contradictory divergence times are reported in the same article. 
For instance, a refreshingly forthright paper by evolutionary biologist Lindell 
Bromham of Australian National University and colleagues in Proceedings of 
the National Academy of Sciences USA analyzed two different molecules, 
mitochondrial DNA and 18S rRNA, to yield individual gene-based divergence 
dates that differed by as much as 1 billion years.** Another study investigating 
the divergence between arthropods and vertebrates found that depending on 
which gene was used, the divergence date might be anywhere between 274 
million and 1.6 billion years ago, the former date falling almost 250 million 
years after the Cambrian explosion.** That paper in its conclusion chose to split 
the difference, confidently reporting an arithmetic average of about 830 million 
years ago. Likewise, bioinformaticians Stéphane Aris-Brosou, now at the 
University of Ottawa, and Ziheng Yang, at University College London, found 
that depending on which genes and which estimation methods were employed, 
the last common ancestor of protostomes or deuterostomes (two broadly 
different types of Cambrian animals) might have lived anywhere between 452 
million and 2 billion years ago.” 

A survey of recent deep-divergence studies, by molecular evolutionists Dan 
Graur and William Martin, notes one study in which the authors claim to be 95 
percent certain that their divergence date for certain animal groups falls within a 
14.2-billion-year range—more than three times the age of the earth and clearly a 
meaningless result.2*° Graur and Martin conclude that many molecular-clock 
estimates “look deceptively precise,” but, given the nature of this field, their 
“advice to the reader is: whenever you see a time estimate in the evolutionary 
literature, demand uncertainty!”*’ The title of their paper, published in Trends in 
Genetics, made the point still more vividly: “Reading the Entrails of Chickens: 
Molecular Timescales of Evolution and the Illusion of Precision.” 

Sometimes even different studies of the same or similar groups of molecules 
have generated dramatically different divergence times. For example, Francisco 


Ayala and several colleagues have recalculated the divergence times of 
Metazoan phyla, using mostly the same protein-coding genes as Wray’s team.7® 
Correcting for “a host of statistical problems”*? in the Wray study, Ayala and 
colleagues found that their own estimates “are consistent with paleontological 
estimates”—not with the deep-divergence hypothesis. “Extrapolating to distant 
times from molecular evolutionary rates estimated within confined data-sets,” 
they conclude, is “fraught with danger.”°° Or as Valentine, Jablonski, and Erwin 
conclude, “The accuracy of the molecular clock is still problematical, at least for 
phylum divergences, for the estimates vary by some 800 million years depending 
upon the techniques or molecules used.”°! Reported Precambrian divergence 
times would vary even more dramatically were it not that evolutionary biologists 
and molecular taxonomists ignore certain molecules in their studies to avoid 
grossly contradictory results. Consider, for example, histones—proteins found in 
all eukaryotes involved in packing DNA into chromosomes. Histones exhibit 
little variation from one species to the next.** They are never used as molecular 
clocks. Why? Because the sequence differences between histones, assuming a 
mutation rate comparable to that of other proteins, would generate a divergence 
time at significant variance with those in studies of many other proteins.*° 
Specifically, the small differences between histones yield an extremely recent 
divergence, contrary to other studies. Evolutionary biologists typically exclude 
histones from consideration, because those times do not confirm preconceived 
ideas about what the Precambrian tree of life ought to look like. 

But that raises obvious questions. If we don’t have fossils documenting a 
common animal ancestor, and if genetic studies produce such different and 
contradictory divergence times, how do we know what the tree of life should 
look like and when the first animals began to diverge from a common ancestor? 
If histones change too slowly to provide an accurate calibration of the molecular 
clock, then which molecules do change at the correct rate—and how do we know 
that they do? The answer to these questions for most evolutionary biologists 
usually runs something like this. We already know that the animal phyla evolved 
from a common ancestor and we also know roughly when they did; therefore, we 
must reject studies based on histone sequences because the conclusions of these 
studies would contradict that date. 

But do we really know these things, and if so how? Assumptions about the 
window of time in which the first metazoan, the ancestor of all animals, must 
have lived are clearly not derived from the testimony of molecular genetics 
alone, since the results of sequence comparisons vary so greatly and include 


dates, depending upon the molecule studied, that fall outside that window. 
Instead, as one widely used textbook euphemistically puts it, evolutionary 
biologists must choose “phylogenetically informative” data.** By this, they mean 
sequences that exhibit neither too little nor too much variation—where too much 
and too little are determined by preconceived considerations of evolutionary 
plausibility, rather than by reference to independent criteria for determining the 
accuracy of molecular methods. 

The subjective quality of these conclusions, where scientists “cherry-pick” 
evidence that conforms to favored notions and discard the rest, casts further 
doubt on the extent to which molecular comparisons yield any clear historical 
signal. Only one divergence point could represent the actual universal common 
ancestor of all animals. If, however, comparative sequence analyses generate 
divergence times that are consistent with nearly all possible evolutionary 
histories, with the divergence event ranging from a few million to a few billion 
years ago, then clearly most of these possible histories must be wrong. They tell 
us little about the actual time of the Precambrian divergence, if such an event 
really happened. 


Questionable Assumptions Other problems run even deeper, having to do with 
the assumptions that make comparative sequence analyses possible in the first 
place. These comparisons assume the accuracy of molecular clocks—that 
mutation rates of organisms have remained relatively constant throughout 
geological time. These studies also assume, rather than demonstrate, the theory 
of universal common descent. Both assumptions are problematic. 


Even if we assume that mutation and natural selection (and other similarly 
undirected evolutionary processes) can account for the emergence of novel 
proteins and body plans, we cannot also assume that the protein molecular clock 
ticks at a constant rate. Unlike radiometric dating methods, molecular clocks 
depend on a host of contingent factors. As Valentine, Jablonski, and Erwin note, 
“Different genes in different clades evolve at different rates, different parts of 
genes evolve at different rates and, most importantly, rates within clades have 
changed over time.”*° So great is this variation that one paper in the journal 
Molecular Biology and Evolution cautions, “The rate of molecular evolution can 
vary considerably among different organisms, challenging the concept of the 
‘molecular clock.’ ”°° 

Keep in mind too that molecular clocks are calibrated based on the estimated 


age of presumably ancestral fossils. If, however, such estimates are incorrect by 
even a few million years, or if the fossil used to calibrate the mutation rate does 
not lie at the actual divergence point on the tree of life, the estimated mutation 
rate may be badly skewed. Calibration of molecular clocks depends on an 
accurate understanding of the ancestor-descendant relationships between fossils 
and their presumed descendant taxa. If the fossil used to calibrate the divergence 
time of two later groups was not actually a true ancestor, then the mutation-rate 
calculation based on that fossil’s age may be grossly inaccurate. As Andrew 
Smith and Kevin Peterson note: “Molecular clocks are not error-free and come 
with their own suite of problems. . . . The accuracy of the technique depends 
upon having an accurate calibration point or points, and a reliable phylogeny 
with correct branching order and branch-length estimates.”°’ Because these 
conditions are rarely met, “the idea that there is a universal molecular clock 
ticking away has long since been discredited.”?8 

Applying the molecular clock to dating the alleged Precambrian ancestor of 
the animals further complicates matters. Because there are so few fossils in the 
Precambrian and no clear ancestor-descendant lineages, the calibration of the 
molecular clock must be done on the basis of very different fossil lineages 
arising hundreds of millions of years later. Indeed, without evidence from the 
fossil record (older than 550 million years ago) with which to calibrate the 
molecular clock, any attempt to date the origin of the Cambrian animal phyla 
becomes highly questionable.*? Perhaps for this reason, Valentine, Jablonski, and 
Erwin have wondered whether “molecular clock dates can ever be applied 
reliably to such geologically remote events as Neoproterozoic branchings within 
the Metazoa.”4° (The Neoproterozic is the last Precambrian era.) These 
methodological problems may help account for the cacophony of conflicting 
results. 


SMUGGLING IN DARWIN A SECOND CRUCIAL ASSUMPTION 
BEHIND THE DEEP-DIVERGENCE HYPOTHESIS IS THE IDEA OF THE 
COMMON DESCENT OF ALL THE ANIMAL FORMS—LE., THAT ALL 
THE CAMBRIAN ANIMALS EVOLVED FROM A COMMON 
PRECAMBRIAN ANCESTOR. AS THE TEXTBOOK UNDERSTANDING 
BIOINFORMATICS ADMITS, “THE KEY ASSUMPTION MADE WHEN 
CONSTRUCTING A PHYLOGENETIC TREE FROM A SET OF 
SEQUENCES IS THAT THEY ARE ALL DERIVED FROM A SINGLE 


ANCESTRAL SEQUENCE, I.E., THEY ARE HOMOLOGOUS.”! OR AS 
THE HARVARD UNIVERSITY PRESS TEXTBOOK THE TREE OF LIFE 
STATES, “WE ARE OBLIGED TO ASSUME AT FIRST THAT, FOR EACH 
CHARACTER, SIMILAR STATES ARE HOMOLOGOUS,” WHEREBY 
“HOMOLOGOUS” THE TEXT MEANS CHARACTERS ARE SIMILAR 
BECAUSE THEY SHARE A COMMON ANCESTOR.“ 


This assumption (of universal common descent) raises the possibility that the 
ancestral entities represented by divergence points in these studies are artifacts of 
the assumptions by which molecular data are analyzed. Indeed, the computer 
programs that are used to compare molecular sequences have been written to 
produce trees showing common ancestors and branching relationships regardless 
of the extent to which the genes analyzed may or may not differ. Phylogenetic 
studies compare two or more gene sequences and then use degrees of difference 
to determine divergence points and nodes on a phylogenetic tree. Inherent in that 
procedure is the assumption that the nodes and divergence points existed in the 
past. 

Thus, the deep-divergence studies do not, in any rigorous sense, establish any 
Precambrian ancestral forms. Did a single, original metazoan or bilaterian 
ancestor of the Cambrian animals actually exist? The Precambrian—Cambrian 
fossil record taken on its face certainly doesn’t document such an entity. But 
neither do deep-divergence studies. Instead, these studies assume the existence 
of such ancestors, and then merely attempt, given that assumption, to determine 
how long ago such ancestors might have lived. One could argue that the 
conflicting divergence points do at least show that some common ancestor 
existed in the Precambrian, since, despite their conflicting results, all divergence 
studies indicate at least that. But, again, to invoke molecular studies that assume 
the existence of a common ancestor as evidence for such an entity only begs the 
question. Certainly it provides no reason for using molecular evidence to trump 
fossil evidence. Perhaps the Precambrian rocks do not record ancestors for the 
Cambrian animals because none existed. To foreclose that possibility, and to 
resolve the mystery of the missing Precambrian ancestral fossils, evolutionary 
biologists cannot use studies that assume the existence of the very entity their 
studies are thought to establish. 


THE “SHMOO”: A CATCH-22 REVISITED THE CONCEPT OF DEEP 
DIVERGENCE RAISES ANOTHER ISSUE RELATED TO MY DISCUSSION 


AT THE END OF THE PREVIOUS CHAPTER ABOUT WHAT WOULD BE 
REQUIRED TO DOCUMENT THE MISSING ANCESTRAL FORMS OF 
THE CAMBRIAN ANIMALS. RECALL THAT I ARGUED THERE THAT 
ANY PLAUSIBLE POSTULATED COMMON ANCESTOR TO ALL THE 
ANIMAL PHYLA MUST HAVE NECESSARILY LACKED MOST (OR ALL) 
OF THE SPECIFIC ANATOMICAL FEATURES THAT DISTINGUISH ONE 
PHYLUM FROM ANOTHER. THE MORE ARTHROPOD-LIKE A 
HYPOTHETICAL ANIMAL FORM, THE LESS PLAUSIBLE IT WOULD 
HAVE BEEN AS AN ANCESTOR TO THE CHORDATES, MOLLUSKS, 
ECHINODERMS, ANNELID WORMS, AND VICE VERSA. IN EACH 
CASE, THE DESIGN LOGIC AND ARRANGEMENT OF PARTS 
NECESSARY TO PROVIDE THE FOUNDATION FOR ONE MODE OF 
ANIMAL LIFE PRECLUDE IT FROM PROVIDING THE FOUNDATION 
FOR OTHER MODES OF ANIMAL LIFE—JUST AS A SYSTEM OF PARTS 
PROVIDING ONE MODE OF TRANSPORTATION (WITH A BICYCLE, 
FOR EXAMPLE) WILL TYPICALLY PRECLUDE FUNCTIONING AS 
ANOTHER (AS WITH A SUBMARINE, FOR EXAMPLE).*° 


For this reason, biologists thinking about the characteristics of the earliest 
ancestor of all the metazoan phyla—the actual animal at the deep-divergence 
point—have typically postulated an extremely simple form of life—what one 
evolutionary biologist described to me as a “shmoo,” after the blob-like cartoon 
character made famous by Al Capp in the 1940s and 1950s. Some have proposed 
that the ur-animal might have been something like a placozoan, a modern 
amorphous animal with only four cell types and no bilateral symmetry.“* Other 
paleontologists have mainly characterized the hypothetical ur-metazoan 
negatively, by reference to the characteristics that it must not have had to be a 
plausible common ancestral form to all other metazoa. (This need to characterize 
the ur-metazoan negatively has led some leading paleontologists to question 
whether the ur-animal can be described affirmatively with any specificity.) In 
any case, the need to characterize the ur-animal as an extremely simple “shmoo- 
like” form, lacking the numerous characteristics and anatomical novelties 
present in the Cambrian animals, highlights a deep dilemma for evolutionary 
theorists. On the one hand, to be plausible as a common ancestor of all the 
animal phyla, a hypothetical ur-metazoan must have few characteristics of later 
metazoan forms. Indeed, the more plausible the hypothetical ancestor, the 
simpler it must be, meaning it will lack more of the specific distinguishing 


features of the individual animal phyla. But that means any evolutionary 
scenario for the origin of the animals that postulates such a “stripped down” 
animal form as its starting point will need to envision those distinguishing 
characteristics arising later. And the fewer the number of characteristics in the 
hypothetical common ancestor, the more such characteristics will need to arise 
later. This logical requirement implies, in turn, the need for an even deeper 
divergence point in Precambrian history and the need for more time to produce 
these specific anatomical novelties—in turn, exacerbating the problem of fossil 
discontinuity. The more plausible the hypothetical common ancestor, the deeper 
the necessary divergence point and the greater the morphological discontinuity 
in the fossil record. 

On the other hand, proposing a more complex (and more anatomically 
differentiated) common ancestor closer in its affinities to some Cambrian animal 
forms, would eliminate the need for so deep a divergence point. Nevertheless, it 
would also diminish the plausibility of such a hypothetical ancestor as an ur- 
metazoan common to all the other Cambrian animals. Again, the more a 
hypothetical form resembles one of the specific animal forms or phyla, the less 
plausible it will be as an ancestor of all the others. And that is the dilemma. 
Could there have been an animal form simple enough to serve as a viable 
ancestor common to all the animal phyla? Perhaps. But positing such a form 
only deepens the required depth of the divergence point and intensifies the 
already significant problem of Precambrian—Cambrian fossil discontinuity. 


DEEP TROUBLE COMPARATIVE GENETIC ANALYSES DO NOT 
ESTABLISH A SINGLE DEEP-DIVERGENCE POINT, AND THUS DO NOT 
COMPENSATE FOR A LACK OF FOSSIL EVIDENCE FOR KEY 
CAMBRIAN ANCESTORS—SUCH AS THE UR-BILATERIAN OR UR- 
METAZOAN ANCESTOR. THE RESULTS OF DIFFERENT STUDIES 
DIVERGE TOO DRAMATICALLY TO BE CONCLUSIVE, OR EVEN 
MEANINGFUL; THE METHODS OF INFERRING DIVERGENCE POINTS 
ARE FRAUGHT WITH SUBJECTIVITY; AND THE WHOLE ENTERPRISE 
DEPENDS UPON A QUESTION-BEGGING LOGIC. MANY LEADING 
CAMBRIAN PALEONTOLOGISTS, AND EVEN SOME LEADING 
EVOLUTIONARY BIOLOGISTS, NOW EXPRESS SKEPTICISM ABOUT 
BOTH THE RESULTS AND THE SIGNIFICANCE OF DEEP-DIVERGENCE 
STUDIES. FOR EXAMPLE, SIMON CONWAY MORRIS HAS REJECTED 
THE IDEA THAT SUCH STUDIES SHOULD TRUMP FOSSIL EVIDENCE 


OF A MORE EXPLOSIVE, SHALLOW AND RAPID CAMBRIAN 
RADIATION. AFTER ASSESSING THE INCONSISTENT TRACK RECORD 
OF DEEP-DIVERGENCE STUDIES, HE CONCLUDES, “A DEEP HISTORY 
EXTENDING TO AN ORIGINATION IN EXCESS OF 1,000 MYR [MILLION 
YEARS] IS VERY UNLIKELY.”“° CONWAY MORRIS IS ONE OF SEVERAL 

LEADING EVOLUTIONARY BIOLOGISTS AND CAMBRIAN 
PALEONTOLOGISTS WHO HAVE EXPRESSED SKEPTICISM ABOUT 
THESE STUDIES.*” IN ANY CASE, THERE IS NOW LITTLE REASON TO 
REGARD THE DEEP-DIVERGENCE HYPOTHESIS AS A GENUINE 
SOLUTION TO THE CAMBRIAN CONUNDRUM. 


6 
The Animal Tree of Life 


In 2009, in honor of the bicentennial anniversary of Darwin’s birth, a piece of 
artwork was created to adorn the ceiling of an exhibit room at the Natural 
History Museum in London. A paper in the journal Archives of Natural History 
noted that the inspiration for the artwork, titled “TREE,” came from a diagram 
that Darwin had sketched in one of his notebooks—a diagram that later came to 
be known as the “tree of life” (see Fig. 2.11a). One BBC radio program called 
the TREE exhibit the “Darwinian Sistine Chapel.”! Another article in Archives 
of Natural History, a journal published by the University of Edinburgh, remarked 
that, “TREE celebrates Darwinian evolutionism” and “secular science and 
reason.”* 

For many biologists, the iconic image of Darwin’s tree of life represents 
perhaps the single best distillation of what the science of evolutionary biology 
has to teach, namely the “fact of evolution,’ apart from which “nothing in 
biology makes sense.”* Though the fossil record does not directly attest to many 
of the expected intermediate forms represented on Darwin’s tree, leading 
authorities assert that other lines of evidence, particularly from genetics, firmly 
establish Darwin’s tree as the correct picture of the history of life. 

In the previous chapter, we saw that there are many good reasons to doubt the 
deep-divergence hypothesis and its claim to have determined, based upon 
genetic evidence, the time at which the Cambrian animals began to evolve from 
specific Precambrian ancestors. Indeed, the idea that these studies can pinpoint 
when an ur-metazoan, or an ur-bilaterian, arose has engendered increasing 
skepticism among a growing number of evolutionary biologists and 
paleontologists. 

The tree of life as a whole, however, is another matter. Many evolutionary 
biologists think the case for universal common descent is something close to 
unassailable because, they argue, analysis of both anatomical and genetic 


similarities converges on the same basic pattern of descent from a universal 
common ancestor. As Richard Dawkins asserts, “when we look comparatively at 
... genetic sequences in all these different creatures—we find the same kind of 
hierarchical tree of resemblance. We find the same family tree—albeit much 
more thoroughly and convincingly laid out—as we did with . . . the whole 
pattern of anatomical resemblances throughout all the living kingdoms.”° 
Likewise, Jerry Coyne argues that gene sequences independently confirm the 
same set of evolutionary relationships—the same basic tree—established from 
the analysis of anatomy.° Oxford University chemist Peter Atkins is even more 
emphatic: “There is not a single instance of the molecular traces of change being 
inconsistent with our observations of whole organisms.”” 

As a result of this confidence, evolutionary biologists often dismiss the 
missing Precambrian fossil precursors and intermediates as a minor anomaly— 
one awaiting explanation by an otherwise completely adequate theory of the 
history of life. Because most evolutionary biologists are confident that a single 
continuous tree, with a single root, best represents the history of life—and 
explains so many other diverse facts of biology—they continue to think the same 
tree-like pattern also accurately describes the Cambrian explosion and the 
Precambrian history of animal life. Moreover, when evolutionary biologists 
reconstruct the phylogenetic history of a group (including animals), they 
typically do so in a time-independent manner. Their concern is usually to 
establish a relative order of branching along the tree of life, not to establish or 
“pinpoint” a series of absolute dates at which divergences occurred. Thus, 
although deep-divergence studies do not establish the existence of Precambrian 
animal ancestors for all the reasons argued in the previous chapter, the 
uncertainty surrounding the dates in these studies has not, for most evolutionary 
biologists, undermined their confidence in the overall tree-like pattern of animal 
life. Instead, many evolutionary biologists believe the strength of the case for the 
tree of life as a whole, based on other phylogenetic studies of similar genes and 
anatomical traits, indirectly establishes the existence of the missing evolutionary 
precursors of the Cambrian animals. As Coyne explains, “It stands to reason that 
if the history of life forms a tree, with all species originating from a single trunk, 
then one can find a common origin for every pair of twigs (existing species) by 
tracing each twig back through its branches until they intersect at the branch they 
have in common. This node, as we’ve seen, is their common ancestor.”® 

On the basis of similar logic, evolutionary biologists have typically assumed 
that what they think is true of all other forms of life is true of the Cambrian 


forms—that there must be a universal animal tree, the absence of fossil evidence, 
and the conflicting results of deep-divergence studies, notwithstanding. 

To assess the other evidence from genetics that supports this conclusion, it’s 
useful to review how the case for the universal animal tree is similar to the deep- 
divergence hypothesis and also how it is different. To establish both the fact and 
shape of the Darwinian animal tree of life, evolutionary biologists have long 
used methods that assume that both molecular sequences and anatomical 
similarities provide an accurate historical signal about the past. Like deep- 
divergence studies, these methods of “phylogenetic” reconstruction assume that 
the species or larger groupings (taxa) are related by descent from a common 
ancestor. (The term phylogeny, again, refers to the evolutionary history of a 
group of organisms. Thus, a “phylogenetic reconstruction” is an attempt to 
determine that history.) Such studies assume that the degree of difference 
between molecular or anatomical features in pairs of organisms indicates how 
long ago they diverged from a common ancestor. They also use independent 
calibrations of the molecular clock to calculate the exact divergence times.? 

Unlike deep-divergence studies, however, which attempt to establish just a 
single divergence time—such as that of the common ancestor of all the animal 
phyla—these more detailed phylogenetic studies seek to establish the contours of 
the Precambrian tree of animal life. This involves assessing degrees of 
relatedness among representatives of all the Cambrian phyla to establish 
multiple divergence points and times (nodes on the tree of life) as well as the 
relationships of the major Cambrian groups. 

Investigators employ these methods even in the absence of corroborating 
fossil evidence. In his textbook on fossils and evolution, following a full-page 
depiction of the discontinuous appearance of the Cambrian animals in the fossil 
record, Occidental College geologist Donald Prothero explains, “If the fossil 
record is poor in one particular group, we look to other sources of data.” He 
concludes that two such sources of data, anatomical and molecular data, now 
“converge on a common answer’—one “that is almost certainly ‘the truth’ (as 
much as we can use that term in science).”!° 

But is all of this true? Does analysis of the genetic and anatomical similarity 
of the Cambrian animals really establish that the history of animal life is best 
depicted as a continuously branching tree? Does the pattern of a branching tree 
accurately depict the history of Precambrian and Cambrian animal life, and in so 
doing establish the existence of Precambrian forms that the fossil record fails to 
document? 


THE PRECAMBRIAN AND CAMBRIAN TREE OF ANIMAL LIFE 


History happened once. And if Richard Dawkins is correct that “there is, after 
all, one true tree of life, the unique pattern of evolutionary branchings that 
actually happened,”!! then evolutionary history also happened once. 
Consequently, if we think of evolutionary trees describing the relationships of 
animal groups as hypotheses about an unobserved history (which is what they 
are), then having two or more conflicting hypotheses about only one history— 
the history that actually happened—means that we haven’t figured out what did 
happen. A widely used textbook on phylogenetic methods explains this: “The 
fact that there is only one true tree . . . provides the basis for testing alternative 
hypotheses. If two hypotheses are generated for the same group of species, then 
we can conclude that at least one of these hypotheses is false. Of course, it is 
possible that both are false and some other tree is true.”!¢ 

When a body of evidence supports multiple conflicting historical hypotheses, 
the evidence cannot be sending a definitive historical signal about what 
happened in the past. That raises the possibility that it may not be sending a 
signal at all. Conversely, when the evidence leads investigators to converge 
around a single historical hypothesis, when one hypothesis best explains a whole 
group of clues, it is much more likely that the evidence is telling us what actually 
happened. 

Consider, by way of illustration, a case in which we know a true history of 
ancestor—descendant relationships to see how evidence can converge around a 
single (unequivocal) history. Between 1839 and 1856, Charles Darwin and his 
wife, Emma, produced ten children, listed below in alphabetical order: 


Anne 


Charles 


Elizabeth 


Francis 


George 


Henrietta 


Horace 


Leonard 


Mary 


William 


This alphabetical listing, of course, is not their actual birth order. Instead, it is 
one of a large number of possible birth orders for Darwin’s children, only one of 
which is the correct sequence. Indeed, only one of these arrangements can 
represent the actual Darwin family history. 

Now, suppose I gave you and some friends a pile of historical evidence about 
Darwin’s children, and asked you to “solve for their birth order.” No one would 
consider the problem solved if you came back with more than one order. On the 
other hand, if you came back and presented a single coherent hypothesis of the 
birth order supported by evidence from birth records, family letters, and 
photographs from the Darwin family archives, that would provide persuasive 
evidence that you had obtained the correct solution. Since only one true history 
exists, once you find it the evidence will tend to fall naturally into place. 

But does the evidence for a Precambrian animal tree of life fall similarly into 
place or does it generate multiple conflicting histories? We’ve already seen that 
fossil evidence does not point to a specific Precambrian tree of animal life, or 
perhaps to any tree at all. We’ve also seen that genetic evidence by itself does 
not establish a single divergence point for animal evolution. But what about the 
genetic and anatomical evidence taken together? Does that evidence converge on 
a single history of animal life? If so, then it could well make up for a lack of 
fossil evidence. Otherwise, it would seem to raise an obvious question: Are the 
observed genetic and anatomical “affinities” among the Cambrian phyla sending 
reliable historical signals at all? 


CONFLICTING HISTORIES 


There are several reasons to doubt that evidence of genetic and anatomical 
similarity is sending a reliable signal of the early history of animal life. First, 
comparisons of different molecules frequently generate divergent trees. Second, 
comparisons of anatomical characteristics and molecules frequently produce 
divergent trees. Third, trees based solely on different anatomical characteristics 
often contradict each other. Let’s examine each problem. 


Molecules vs. Molecules 


Just as the molecular data do not point unequivocally to a single date for the last 
common ancestor of all the Cambrian animals (the point of deep divergence), 
they do not point unequivocally to a single coherent tree depicting the evolution 
of animals in the Precambrian. Numerous papers have noted the prevalence of 
contradictory trees based on evidence from molecular genetics. A 2009 paper in 
Trends in Ecology and Evolution notes that “evolutionary trees from different 
genes often have conflicting branching patterns.”!? Likewise, a 2012 paper in 
Biological Reviews acknowledges that “phylogenetic conflict is common, and 
frequently the norm rather than the exception.”!* Echoing these views, a January 
2009 cover story and review article in New Scientist observed that today the tree- 
of-life project “lies in tatters, torn to pieces by an onslaught of negative 
evidence.” As the article explains, “Many biologists now argue that the tree 
concept is obsolete and needs to be discarded,” because the evidence suggests 
that “the evolution of animals and plants isn’t exactly tree-like.” 

The New Scientist article cited a study by Michael Syvanen, a biologist at the 
University of California at Davis, who studied the relationships among several 
phyla that first arose in the Cambrian.!° Syvanen’s study compared two thousand 
genes in six animals spanning phyla as diverse as chordates, echinoderms, 
arthropods, and nematodes. His analysis yielded no consistent tree-like pattern. 
As the New Scientist reported, “In theory, he should have been able to use the 
gene sequences to construct an evolutionary tree showing the relationships 
between the six animals. He failed. The problem was that different genes told 
contradictory evolutionary stories.” Syvanen himself summarized the results in 
the bluntest of terms: “We’ve just annihilated the tree of life. It’s not a tree 
anymore, it’s a different topology [pattern of history] entirely. What would 


Darwin have made of that?”!® 

Other studies trying to clarify the evolutionary history and phylogenetic 
relationships of the animal phyla have encountered similar difficulties. 
Vanderbilt University molecular systematist Antonis Rokas is a leader among 
biologists using molecular data to study animal phylogenetic relationships. 
Nevertheless, he concedes that a century and a half after The Origin of Species, 
“a complete and accurate tree of life remains an elusive goal.”!” In 2005, during 
the course of an authoritative study he eventually copublished in Science, Rokas 
was confronted with this stark reality. The study had sought to determine the 
evolutionary history of the animal phyla by analyzing fifty genes across 
seventeen taxa. He hoped that a single dominant phylogenetic tree would 
emerge. Rokas and his team reported that “a 50-gene data matrix does not 
resolve relationships among most metazoan phyla” because it generated 
numerous conflicting phylogenies and historical signals. Their conclusion was 
candid: “Despite the amount of data and breadth of taxa analyzed, relationships 
among most metazoan phyla remained unresolved.” !® 

In a paper published the following year, Rokas and University of Wisconsin 
at Madison biologist Sean B. Carroll went so far as to assert that “certain critical 
parts of the TOL [tree of life] may be difficult to resolve, regardless of the 
quantity of conventional data available.”!° This problem applies specifically to 
the relationships of the animal phyla, where “[mJany recent studies have 
reported support for many alternative conflicting phylogenies.”*° Investigators 
studying the animal tree found that “a large fraction of single genes produce 
phylogenies of poor quality” such that in one case, a study “omitted 35% of 
single genes from their data matrix, because those genes produced phylogenies 
at odds with conventional wisdom.”*! Rokas and Carroll tried to explain the 
many contradictory trees by proposing that the animal phyla might have evolved 
too quickly for the genes to record some signal of phylogenetic relationships into 
the respective genomes. In their view, if the evolutionary process responsible for 
anatomical novelty works quickly enough, there would not be sufficient time for 
differences to accumulate in key molecular markers, in particular those used to 
infer evolutionary relationships in different animal phyla. Then, given enough 
time, whatever signal did exist might become lost. Thus, when groups of 
organisms branch rapidly and then evolve separately for long periods of time, 
this “can overwhelm the true historical signal”**—leading to the inability to 
determine evolutionary relationships. 


Their article brings the discussion of the Cambrian explosion full circle from 
an attempt to use genes to compensate for the absence of fossil evidence to the 
acknowledgment that genes do not convey any clear signal about the 
evolutionary relationships of the phyla first preserved by fossils in the Cambrian. 
The logic of their analysis also leads them to a strangely familiar conclusion. 
Since the analysis of key genetic markers—like the genes tracked in molecular- 
clock studies that presumably accumulate mutations at a constant rate—shows a 
low number of mutational differences between the Cambrian animal phyla, 
Rokas and Carroll conclude from specifically genetic evidence that the phyla 
must have diverged rapidly. As they put it in another paper, “Inferences from 
these two independent lines of evidence (molecules and fossils) support a view 
of the origin of Metazoa as a radiation compressed in time.”*° Thus, the inability 
to reconstruct the evolutionary history of the animal phyla from the molecular 
data not only fails to establish a Precambrian pattern of descent; it ironically also 
reaffirms the extreme rapidity of the origin of the Cambrian animal forms. 


Molecules vs. Anatomy 


In 1965, chemist Linus Pauling and biologist Emile Zuckerkandl, often hailed as 
the fathers of the molecular-clock concept, proposed a rigorous way to confirm 
evolutionary phylogenies. They suggested that if studies of comparative anatomy 
and DNA sequences generated similar phylogenetic trees, then “the best 
available single proof of the reality of macroevolution would be furnished.”** As 
they went on to explain, “only the theory of evolution . . . could reasonably 
account for such a congruence between lines of evidence obtained 
independently.”*° By focusing attention on these two independent lines of 
evidence and the possibility of their convergence (or conflict), Pauling and 
Zuckerkandl provided a clear and measurable way to test the neo-Darwinian 
thesis of universal common ancestry. 

And according to some scientists, studies of molecular homologies have 
confirmed expectations about the history of the animal phyla derived from 
studies of comparative anatomy. After citing Pauling and Zuckerkandl’s test, 
Douglas Theobald claims in his “29+ Evidences for Macroevolution” that “well- 
determined phylogenetic trees inferred from the independent evidence of 
morphology and molecular sequences match with an extremely high degree of 
statistical significance.”~° 

In reality, however, the technical literature tells a different story. Studies of 


molecular homologies often fail to confirm evolutionary trees depicting the 
history of the animal phyla derived from studies of comparative anatomy. 
Instead, during the 1990s, early into the revolution in molecular genetics, many 
studies began to show that phylogenetic trees derived from anatomy and those 
derived from molecules often contradicted each other. 

Probably the most protracted conflict of this type concerns a widely accepted 
phylogeny for the bilaterian animals. This classification scheme was originally 
the work of the influential American zoologist Libbie Hyman.*” Hyman’s view, 
generally known as the “Coelomata” hypothesis, was based on her analysis of 
anatomical characteristics, mainly germ (or primary tissue) layers, planes of 
body symmetry, and especially the presence or absence of a central body cavity 
called the “coelom,” which gives the hypothesis its name. In the Coelomata 
hypothesis, the bilaterian animals were classified in three groups, the 
Acoelomata, the Pseudocoelomata, and the Coelomata, each encompassing 
several different bilaterian animal phyla.~® (See Fig. 6.1a.) 

Then, in the mid-1990s, a very different arrangement of these animal groups 
was proposed based on the analysis of a molecule present in each (the 18S 
ribosomal RNA; see Fig. 6.1b). The team of researchers who proposed this 
arrangement published a groundbreaking paper in Nature with a title that 
surprised many morphologists: “Evidence for a Clade of Nematodes, Arthropods 
and Other Moulting Animals.”*? The paper noted the conventional wisdom, 
based on Hyman’s hypothesis, that arthropods and annelids were closely related 
because both phyla had segmented body plans.°” But their study of the 18S 
ribosomal RNA suggested a different grouping, one that placed arthropods close 
to nematodes within a group of animals that molt, which they called 
“Ecdysozoa.” This relationship surprised anatomists, since arthropods and 
nematodes don’t exactly look like kissing cousins. Arthropods (such as trilobites 
and insects) have coeloms, whereas nematodes (such as the tiny worm 
Caenorhabditis elegans) do not, leading many evolutionary biologists to believe 
nematodes were early branching animals only distantly related to arthropods.*! 
The Nature paper explained how unexpected this grouping of arthropods and 
nematodes was: “Considering the greatly differing morphologies, embryological 
features, and life histories of the molting animals, it was initially surprising that 
the ribosomal RNA tree should group them together.”°7 
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A. COELOMATA HYPOTHESIS B. ECDYSOZOA HYPOTHESIS 


FIGURE 6.1 

How scientists reconstruct evolutionary history depends on which similarities 
they regard as revealing the true history of descent (homology) and which 
similarities they regard as misleading (homoplasy). Advocates of the Coelomata 
hypothesis (Figure 6.1a) regard the coelom (body cavity) as a homologous 
feature. Thus, they think the presence of a coelom in both arthropods and 
vertebrates indicates a common ancestor that possessed a coelom (indicated by 
the solid horizontal line in 6.1a). But advocates of the Ecdysozoa hypothesis 
(6.1b) think the coelom evolved at least twice independently (indicated by the 
two dashed horizontal lines in Figure 6.1b). They regard the presence of the 
coelom as a historically misleading similarity—one that does not indicate the 
presence of that feature in the most recent common ancestor of the groups 
possessing it. These two hypotheses and their implied histories are not 
congruent, and cannot both be true. 





Since the Ecdysozoa hypothesis was first proposed, other scientists have 
vigorously opposed it, reaffirming the Coelomata hypothesis, based on the 
analysis of other molecular evidence.*? Advocates of the Ecdysozoa grouping 
pushed back hard, however,** contending that, properly interpreted, available 
genetic evidence supports the Ecdysozoa, not the Coelomata hypothesis.*° 

My point in summarizing these disputes is simply to note that the molecular 
and anatomical data commonly disagree, that one can find partisans on every 
side, that the debate is persistent and ongoing, and that, therefore, the statements 
of Dawkins, Coyne, and many others about all the evidence (molecular and 
anatomical) supporting a single, unambiguous animal tree are manifestly false. 


As can readily be seen by comparing Figures 6.1a and 6.1b,*° these hypotheses 
—Coelomata and Ecdysozoa—contradict each other. Although both might be 
false, both cannot be true. 

Various papers analyzing other groups have found similar discrepancies 
between molecular and morphological versions of the animal tree. A paper by 
Laura Maley and Charles Marshall in the journal Science noted, “Animal 
relationships derived from these new molecular data sometimes are very 
different from those implied by older, classical evaluations of morphology.”?” 
For example, when tarantulas were used as the representative of arthropods, the 
arthropods were grouped more closely to mollusks than to deuterostomes 
(animals that develop anuses first and mouths later). This makes sense because 
both mollusks and arthropods are protostomes (animals that develop mouths first 
and anuses later). But when brine shrimp were used as the representative of the 
arthropods, the arthropods became the odd man out. Now mollusks were 
grouped most closely to deuterostomes, far away from arthropods—a result 
clearly at odds with the conventional phylogeny based upon anatomical 
characteristics.°° 

Examples of similar conflicts abound. The traditional phylogeny placed 
sponges at the bottom of the animal tree, with progressively more complex phyla 
(e.g., cnidarians, flatworms, nematodes) branching off. But Valentine, Jablonski, 
and Erwin note that molecules “indicate a very different configuration” of the 
tree, where some higher deuterostome phyla branch off very early and some 
comparatively less complex phyla branch very late.°° 

Likewise, morphological studies suggest phoronids (see Fig. 3.6) and 
brachiopods (see Fig. 1.3), both marine filter-feeding animals, are 
deuterostomes, but molecular studies classify them within protostomes.*° 
Morphological studies typically imply that sponges are monophyletic (all part of 
an exclusive branch on the tree of life) because of their distinctive body 
architecture, but molecular studies suggest that sponges don’t belong to a single 
unified group, with some sponges more closely related to jellyfish than they are 
to other sponges.*! Cnidarians and ctenophores have similar body plans, leading 
many to expect they were closely related on the basis of morphology. But 
molecular data have distanced these phyla significantly.4* As a major review 
article in Nature in 2000 observes, “Evolutionary trees constructed by studying 
biological molecules often don’t resemble those drawn up from morphology.” 
And the problem isn’t getting better over time. A 2012 paper admits that larger 


datasets are not solving this problem: “Incongruence between phylogenies 
derived from morphological versus molecular analyses and between trees based 
on different subsets of molecular sequences has become pervasive as datasets 
have expanded rapidly in both characters and species.”“4 

Indeed, the widespread discrepancies between molecular data and 
morphological data and between various molecule-based trees have led some to 
conclude that Pauling and Zuckerkandl were wrong to assume that the degree of 
similarity indicates the degree of evolutionary relatedness.*° As Jeffrey H. 
Schwartz and Bruno Maresca put it in the journal Biological Theory: “This 
assumption derives from interpreting molecular similarity (or dissimilarity) 
between taxa in the context of a Darwinian model of continual and gradual 
change. Review of the history of molecular systematics and its claims in the 
context of molecular biology reveals that there is no basis for the ‘molecular 
assumption.’ ”4° 


Anatomy vs. Anatomy 


Attempts to infer a consistent picture of the history of animal life based on 
analyzing the anatomical characteristics of different animals have also proven 
problematic. In the first place, there is a general and long-standing problem with 
attempts to infer the evolutionary history of the animal phyla from similar 
anatomical traits. At the level of the phyla—that is, when one compares the 
phyla to each other and tries to determine their branching order—the number of 
shared anatomical characteristics available for inferring evolutionary 
relationships drops off quite dramatically. There is an obvious reason for this. 
For example, an anatomical character such as the “leg,” that is useful for 
diagnosing and comparing arthropods, which possess legs, proves useless for 
making comparisons between (for example) brachiopods or bryozoans, which do 
not. In the same way, basic structural features of human-designed systems, such 
as the distinctive submarine “trait” of an encapsulating watertight hull, might 
help to distinguish it from a cruise ship, which is only watertight on its 
underside. But this “trait” would be irrelevant for comparing and classifying, 
say, suspension bridges, motorcycles, or flat-screen televisions. In a similar 
manner, biologists find that there are only a handful of highly abstract characters, 
such as radial versus bilateral body symmetry, the number of fundamental tissue 
layers (triploblasty, three layers, versus diploblasty, two layers), or the type of 
body cavity present (true coelom, pseudocoelom, or no coelom), available for 


morphological comparisons of the many diverse animal forms. Yet evolutionary 
biologists have often disputed the validity of these rather abstract traits as guides 
to evolutionary history.” In addition, just as trees based upon the analysis of 
different sets of similar genes or proteins often conflict, trees constructed on the 
basis of different developmental and anatomical characteristics often conflict. 

When biologists construct phylogenetic trees based upon anatomical 
characteristics, they typically group the animal phyla according to the presence 
or absence of several key characteristics. For example, the standard version of 
the animal tree, based upon anatomy, groups animals according to their style of 
body plan symmetry and by their mode of body plan development. As noted 
earlier, animals with mirror symmetry along their vertical head-to-tail axes all 
fall within the Bilaterian group. Animals with radial symmetry (or no symmetry) 
fall outside this group. Within the Bilateria, taxonomists distinguish other main 
groups—protostomes and deuterostomes—based upon their differing modes of 
body plan development—.e., either “mouth first” or “anus first.” 

Yet a significant difficulty arises when evolutionary biologists consider how a 
particularly fundamental characteristic—the mode of germ-cell formation—is 
distributed among various groups on the canonical animal tree of life (see Fig. 
6.2).48 Germ cells produce eggs and sperm (in any sexually reproducing species) 
or gametes (in any asexually reproducing species), giving rise to the next 
generation.*? Animals have two main ways of generating germ cells. In one 
mode of germ-cell formation, known as preformation, cells inherit internal 
signals from a region within their own cell structure to become germ cells 
(illustrated by solid black squares on Fig. 6.3a). In the other main way of 
generating germ cells, known as epigenesis, germ cells receive external signals 
from surrounding tissues to become primordial germ cells (PGCs, illustrated by 
solid white squares in Fig. 6.3b). 

Germ-cell formation has indisputable evolutionary importance. To evolve, a 
population or a species must leave offspring; to leave offspring, species of 
animals must generate primordial germ cells. No PGCs, no reproduction; no 
reproduction, no evolution. 

One might expect, therefore, that if a group of animals is all derived from a 
common ancestor (with a particular mode of gamete production), then the mode 
of germ-cell formation should also be essentially the same from one animal 
species to the next in that group. Further, assuming the common ancestry of all 
animals, our expectation of homologous modes of germ-cell formation among 
the animals ought to be higher than for any other tissue type, cell line, or mode 


of development. Why? Because mutations affecting the developmental 
mechanisms that govern PGC formation inevitably disrupt successful 
reproduction.°” Again, if a species cannot reproduce, it cannot evolve.°! 

Thus, similar groups of animals—indeed, all animals, if they have descended 
from a common ancestor—ought to exhibit the same basic mode of germ-cell 
formation. Further, that the evolutionary tree derived from an analysis of the 
“mode of germ-cell formation” ought to be congruent with the trees derived 
from other such fundamental characteristics (such as body-plan symmetry, mode 
of development, number of primary tissues, and so forth). 

But the mode of germ-cell formation is nearly randomly distributed among 
the different animal groups, making it impossible to generate a coherent tree 
based on this characteristic, let alone making any comparison between such a 
tree and the canonical tree. Note also the distribution of the two basic modes of 
germ-cell development within the animal phyla as depicted on the canonical tree. 
Figure 6.4, derived from the work of Harvard developmental biologist Cassandra 
Extavour,°* shows this distribution and provides another way of understanding 
the incongruence that arises when analyzing different anatomical characteristics. 


FIGURE 6.2 


The canonical tree of the Metazoa as determined by the analysis of selected 
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A. PREFORMATION 


B. EPIGENESIS 


FIGURE 6.3 

Two modes of germ-cell formation. Figure 6.3a (left): In fruit flies as the egg is 
being formed, the mother’s nurse cells (the four cells on the left of the oval- 
shaped egg chamber, indicated by large black circles) deposit proteins and 
RNAs, which are transported to the posterior pole of the egg (indicated by the 
dark patch to the right of the large cell on the right). These maternally 
synthesized molecules then trigger the development of the germ cells and sex 
organs of the fly during embryogenesis. Figure 6.3b (right): In the eggs of mice 
there are no maternally deposited products that determine germ-cell formation. 
Rather, as the embryo develops a subpopulation of cells (represented by the 
triangles on the left) express “germline competence genes.” These cells then 
“read” signals that arrive from other tissues (see arrows), causing the cells to 
differentiate into primordial germ cells (as indicated by the stars on the right). 


Notice the two modes of germ-cell formation do not cluster together in 
separate parts of the canonical tree. Instead, they are distributed haphazardly 
among various phyla on different branches of the tree. In the protostomes, for 
instance, modes of germ-cell formation wink on and off between preformation 
and epigenesis. The same is the case within the deuterostomes: germ-cell 
formation varies almost randomly, and several groups exhibit both modes, 
rendering it difficult or impossible to determine which characteristic was present 
at different ancestral branching points. Noting this pattern of distribution, 
Cassandra Extavour concludes that “the data presently available cannot suggest 
homologies of the somatic components of metazoan gonads.”°>* 
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FIGURE 6.4 

The distribution of modes of primordial germ-cell formation (epigenesis, 
preformation, or both) among various animal groups. The solid thin lines 
between the boxes on the right and the phyla names on the left show where the 
different modes of germ-cell formation are present in different phyla. The nearly 
random distribution of types of germ-cell formation among the various animal 
groups makes it impossible to generate an animal phylogeny (evolutionary 
history) based upon this character that will match the evolutionary history 
implied by the canonical animal tree of life. 
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FIGURE 6.5 

A selection of incompatible (mutually incongruent) phylogenetic trees 
representing the history of the major groups of animals, drawn from the 
zoological and evolutionary literature (1940-present). Note: The definitions of 
some taxonomic groups in some of these phylogenetic trees may have changed 
significantly since the time those phylogenies were originally constructed. 
Branch lengths may not always be drawn to scale. 


After completing a survey of many such difficulties, University of St. 
Andrews zoologist Pat Willmer and Oxford University zoologist Peter Holland, 
experts on invertebrate anatomy, draw this conclusion: “Taken together, modern 
re-evaluations of traditional evidence support different and mutually exclusive 
subsets of [phylogenetic] relations.”°* They go on to observe that “patterns of 
symmetry, the number of germ layers in the body, the nature of the body cavity, 
and the presence or type of serial repetition [segmentation] have all been used to 


infer common ancestry.” But, they explain, the phylogenetic story these 
characteristics tell is “now either unacceptable or at least controversial” because 
the data are, at best, inconsistent.°° 

The historical record of ongoing uncertainty about the animal tree of life 
since 1859 confirms, as one respected textbook on invertebrate animals explains, 
that “phylogenetic analysis at the level of the phyla is highly problematical.”°° 
As a result, “the study of higher level animal phylogeny has yielded an 
expansive literature but relatively little detailed consensus. .. . In point of fact, 
there exists no such thing as ‘the traditional textbook phylogeny.’ A diversity of 
different schemes can be found.”°” To appreciate this problem visually, look at 
Figures 6.1a and 6.1b as well as Figure 6.5. These show some of the many 
metazoan phylogenies, based on anatomy, published in the twentieth century. 
These branching patterns plainly do not agree with one other. 


THE ASSUMPTIONS OF PHYLOGENETIC INFERENCE 


All these problems underscore several fundamental difficulties with the methods 
of phylogenetic reconstruction. When biologists analyze multiple anatomical 
traits or genes, the animal phyla consistently defy attempts to arrange them into 
the pattern of a single tree. Yet if there was a period of hidden Precambrian 
evolution, and if comparative sequence analyses reveal the actual history of 
animal life and, by implication, the existence of Precambrian animal forms, 
phylogenetic studies should converge more and more around a single tree of 
animal life. Just as only one possible divergence point could represent the event 
at which animal forms began to evolve from a common animal ancestor, only 
one of the many trees produced by phylogenetic analysis can represent the actual 
Precambrian history of animal life. If, instead, phylogenetic analyses 
consistently generate different possible evolutionary histories, it’s difficult to see 
how any one of them could be known to be sending a reliable historical signal. 
Again, the history of animal life only happened once. 

One could argue that these conflicting trees do, at least, show that some tree- 
like evolutionary pattern of common ancestry preceded the Cambrian, since all 
conflicting trees do affirm that. But, again, they all “show” this because they all 
presuppose it, not because they demonstrate it. 


CONVERGENT EVOLUTION 


There is yet another reason to wonder whether studies of anatomical or 
molecular homology convey anything definitive about the history of life. Many 
animals have single traits or features in common with otherwise decidedly 
different animals. In such cases, it makes no evolutionary sense to classify these 
forms as closely related ancestors. For example, moles and mole crickets have 
remarkably similar forelimbs, though moles are mammals and mole crickets are 
insects. No evolutionary biologist regards these two animals as closely related, 
for understandable reasons. 

The theory of universal common descent assumes that, generally, the more 
similar two organisms are, the more closely related they must be. Assuming 
common descent, animals with wildly differing body plans should not be closely 
related. The presence of nearly identical individual traits or structures within 
organisms exemplifying otherwise different body plans cannot, therefore, be 
attributed to evolution from a common ancestor. Instead, evolutionary biologists 
attribute similar traits or structures in such a context to so-called convergent 
evolution, the separate or independent origin of similar characters emerging on 
separate lines of descent after the point at which those lines diverged from their 
last common ancestor. Convergent evolution demonstrates that similarity does 
not always imply homology, or inheritance from a common ancestor. 

For this reason, the repeated need to posit convergent evolution (and other 
related mechanisms)’ casts further doubt on the method of phylogenetic 
reconstruction. Invoking convergent evolution negates the very logic of the 
argument from homology, which affirms that similarity implies common 
ancestry, except—we now learn—in those many, many cases when it does not. 
Repeatedly invoking convergence negates the assumption that justified the 
method of phylogenetic reconstruction in the first place, namely, that similarity 
is a reliable historical signal of common ancestry. 


A FAMILY REUNION? 


So what lesson should we draw from these many conflicting trees? Clearly, these 
contradictory results call into question the existence of a canonical tree of animal 
life. To see why, imagine being invited to an event billed as an extended-family 
reunion where you’ve been told you will encounter hundreds of your relatives, 


most of whom you have never met. Let’s call the description in the invitation the 
“reunion hypothesis.” The invitation says that a group photograph is planned, for 
which relatives will be grouped together according to their degree of relationship 
(first cousins with first cousins, and so on). 

You show up and grab a coleslaw, eager to meet these many previously 
unknown relatives. You see the hundreds gathered and have every reason to 
believe that the “reunion hypothesis” is true. After all, the invitation in your 
mailbox described the event as a family reunion. 

As the day goes on, however, something seems amiss. Here and there, you 
see familiar facial features—“Yes,” you think, that person could be my 
cousin”—but the majority of attendees and all the strangers you engage in 
conversation exhibit no discernible family resemblances. Nor does anyone seem 
to share any personal relationships with anyone else at the reunion, no matter 
how long they chat and try to establish points of commonality. What’s more, 
each person tells a different story of his or her family history. You try to group 
the strangers by physical characteristics (height, hair color, body type, and so 
on), but the characteristics that you find fail to yield evidence of family ties or 
genealogical connections. Whatever points of commonality you find refuse to 
assemble into a coherent, consistent story. The pedigrees are unclear. The 
reunion hypothesis is under major strain. 

When the photographer arrives, she gamely tries to draw everyone together 
for the planned photograph. Chaos ensues. No one knows where to stand, 
because the family relationships are so unclear, if they exist at all. After milling 
around hopelessly for half an hour or so, people depart for their cars, wondering 
why they bothered to attend the picnic, and indeed why they were invited in the 
first place. 

Do you now have good reasons to doubt the “reunion hypothesis”? You do. If 
the family reunion hypothesis were true, it would have been increasingly 
confirmed, because the evidence would have converged on a single consistent 
pattern. Had there been a true pattern of family relationships, the longer 
everyone talked, the more a single coherent pattern of relationship would have 
become apparent. But the people at the picnic were not your relatives, at least 
not in any way you could determine to be true. Instead of converging on a single 
pattern, the “evidence” was all over the map. 


A FOREST OF TREES 


Of course, my family reunion illustration breaks down as an analogy to the 
history of animal life, because if we could trace the history of all the people at 
the reunion back far enough we would find that they are all related by common 
ancestry. Though we can choose to assume that the same is true of the Cambrian 
animals, neither the fossil evidence nor the evidence of genetics and comparative 
anatomy actually establishes that. These three classes of evidence either provide 
no compelling evidence for Precambrian animal ancestors (in the case of fossils), 
or they provide question-begging and conflicting evidence (in the case of genes 
and anatomy). 

And that is the point of my story. Since there can be only one true history of 
the Cambrian animals, the evidence should converge on a common family tree— 
if indeed we are looking at evidence of true history. The picture given by the 
evidence should be stable, not constantly changing. But the evidence from a 
variety of quarters has instead continually generated new, conflicting, and 
incoherent pictures of the history of animal life. As with the “cousins” in my 
illustration, there seems to be no consistent and coherent way to organize the 
animal groups into a family tree. 

But if the genes don’t tell the story of Precambrian ancestral forms, if they 
don’t compensate for a dearth of fossil evidence and establish a unequivocal 
long cryptic history of animal life from an original animal, an ur-metazoan, then 
logically we are back to taking the fossil record at face value. In that case, the 
mystery of the missing ancestral fossils remains. If so, is there any way to 
explain the abrupt appearance of new forms of life in the fossil record within an 
evolutionary framework? During the 1970s, two young paleontologists thought 
that there just might be a way to do exactly that. 


7 
Punk Eek! 


Scientific discoveries are rarely made in Laundromats, but at least one great 
scientific breakthrough—an “aha” moment—occurred in one. The year was 
1968, more than a decade before the discovery of the first Chengjiang fossils. 
The scientist overtaken by the muses was paleontologist Niles Eldredge. One 
day while standing in a Michigan Laundromat, following months of collecting 
trilobite fossils for his Ph.D. research, Eldredge happened to reach into his 
pocket. He removed one of the fossils he had been collecting, a specimen of a 
trilobite species called Phacops rana. Initially, as he examined the specimen, he 
felt “depressed.” The fossil closely resembled many others that he had found 
across layers of strata during his fieldwork in the Midwest. His trilobites showed 
no evidence of gradual change, as classical neo-Darwinism had taught him to 
expect.! 

As Eldredge explained in a lecture at the University of Pittsburgh in 1983, he 
then experienced a kind of scientific epiphany. He realized that the “absence of 
change itself” was “a very interesting pattern.” Or as he later put it, “Stasis is 
data.”* “Stasis” is the term that Eldredge and his scientific collaborator, Stephen 
Jay Gould (see Fig. 7.1), later gave to the pattern in which most species, “during 
their geological history, either do not change in any appreciable way or else they 
fluctuate mildly in morphology, with no apparent direction.”’ As Eldredge 
examined that solitary trilobite, he realized that he had been observing evidence 
of stasis for some time—however much he might have wanted it otherwise. As 
he explained, “Stasis . .. was by far the most important pattern to emerge from 
all my staring at Phacops specimens.” He continued, “Traditionally seen as an 
artifact of a poor record, as the inability of paleontologists to find what 
evolutionary biologists going back to Darwin had told them must be there, stasis 
was, as Stephen Jay Gould put it, ‘paleontology’s trade secret-—an embarrassing 
one at that.”* 


This embarrassing realization proved pivotal, eventually leading Eldredge 
and Gould to reject both the gradualistic picture of evolutionary change 
articulated by Darwin and the neo-Darwinian understanding of the mechanism 
by which such change allegedly takes place. It also led them to formulate, in a 
series of scientific papers from 1972 to 1980, a new theory of evolution known 
as “punctuated equilibrium.”° 

As a consequence of this theory, neither Gould nor Eldredge expected to find 
a wealth of transitional intermediate forms in the fossil record. In their view, the 
main periods of biological innovation simply occurred too rapidly to leave many 
fossil intermediates behind.° 

Gould and Eldredge sought to explain the occurrence of the rapid periods’ of 
change (i.e., the punctuations) as the by-product of different kinds of 
evolutionary mechanisms or processes of change. They proposed, first, a 
mechanism called “allopatric speciation” to explain the rapid generation of new 
species. Gould, Eldredge, and another early advocate of punctuated equilibrium 
named Steven Stanley, a paleontologist at Johns Hopkins University, also 
proposed that natural selection operated on higher levels. Rather than natural 
selection favoring the fittest individual organisms within a species—as it does in 
classical Darwinism and neo-Darwinism—these paleontologists proposed that it 
often selected the most fit species among a group of competing species. Because 
they thought that speciation occurred more rapidly, and because they thought 
that natural selection acted on whole species and not just individual organisms, 
the advocates of punctuated equilibrium theorized that morphological change 
typically occurs in larger, more discrete jumps than Darwin first envisioned. 





FIGURE 7.1 
Figure 7.1a (left): Stephen Jay Gould. Courtesy Getty Images. Figure 7.1b 
(right): Niles Eldredge. Copyright © Julian Dufort 2011. Used with permission. 


Thus, in one sense, the theory of punctuated equilibrium, like the artifact 
hypothesis, sought to explain the absence of the transitional intermediate forms 
that were expected based on Darwin’s theory. By repudiating Darwinian 
gradualism, the advocates of punctuated equilibrium sought to account for the 
absence of transitional forms in the fossil record apart from the artifact 
hypothesis or, at best, using what they imagined as a more modest version of it. 
But in repudiating Darwinian gradualism, punctuated equilibrium also 
represented a radically different view of the pace and mode of evolution—a new 
theory of evolution that purported to identify a new mechanism of evolutionary 
change. As historian of science David Sepkoski explains, “Gould and Eldredge 
proposed a radical revision of this standard [neo-Darwinian] narrative. They 
argued that the pattern of evolutionary history really was composed of fits and 
Starts, consisting of long periods of evolutionary stasis (or ‘equilibrium’) 
‘punctuated’ by shorter periods of rapid speciation.”® 

During the 1970s and 1980s, the theory of punctuated equilibrium, or “punk 
eek” as it is affectionately known, generated both intense scientific debate and 
extensive media coverage.? Critics called the model “evolution by jerks,” 
leading Gould to reply that proponents of gradualism were offering “evolution 


by creeps.”!° Though initially Eldredge played more of a role in formulating the 
theory, Stephen Jay Gould emerged as the leading spokesman for it. As a result 
of his advocacy of the theory as well as his popular science writing, Gould 
achieved immense celebrity—celebrity that has in turn secured an enduring 
place for punctuated equilibrium in scientific awareness. 

So what has become of this bold scientific proposal? Does punctuated 
equilibrium solve the problems that traditional neo-Darwinism does not? Does it 
help explain the Cambrian explosion and the missing fossil intermediates that 
render it so mysterious? 


WANTED: FAST ENGINE ONCE THEY DECIDED TO TAKE THE FOSSIL 
RECORD AT FACE VALUE, THE QUESTION FOR GOULD AND 
ELDREDGE WAS OBVIOUS: WHAT COULD GENERATE 
EVOLUTIONARY CHANGE SO RAPIDLY? TO EXPLAIN THE SHORT 
BURSTS OR PUNCTUATIONS, GOULD AND ELDREDGE PROPOSED A 
MECHANISM OF RAPID SPECIATION TO WHICH STANLEY ADDED 
(WITH THEIR AGREEMENT) A NEW UNDERSTANDING OF THE 
MECHANISM OF NATURAL SELECTION.!! 


Whereas the neo-Darwinian mechanism of natural selection acting on random 
mutations necessarily acts slowly and gradually, Gould and Eldredge invoked a 
process called “allopatric speciation” to explain how new species might arise 
quickly. The prefix allo means “other” or “different,” and the suffix patric means 
“father.” Thus, allopatric speciation refers to processes that generate new species 
from separate parent (or “father”) populations. Allopatric speciation typically 
occurs when part of a population of organisms becomes geographically isolated 
—perhaps by the emergence of a mountain range or the shifting of a river’s 
course—from a larger parent population and then a daughter population changes 
in response to differing environmental pressures. 

Gould and Eldredge drew on insights from population genetics to explain 
why new genetic traits were more likely to spread and establish themselves 
within these smaller subpopulations. Population genetics, a subject to which I'll 
return in Chapter 12, describes the processes by which genetic traits change and 
become fixed in a population of organisms. It teaches that in typically large 
populations of organisms, it is difficult for a newly arising genetic trait to spread 
throughout an entire population. Yet for any evolutionary change to occur in a 
population, new genetic traits must become widespread, or “fixed,” by a process 


called “fixation.” 

In smaller populations, however, the probability of a newly arising trait 
becoming fixed is much higher, since the new trait needs to spread to fewer 
organisms. By way of illustration, consider a bag containing fifty red and fifty 
blue marbles. Suppose that, by removing individual marbles at random, you seek 
to change the “population” of mixed-color marbles to one in which all the 
marbles are red. To produce a completely red “species,” we must generate a 
population in which all the blue marbles have been eliminated. If someone 
randomly picks half of the one hundred marbles out of the bag, it is extremely 
improbable that all of the marbles thus eliminated will be of just one color. 
Indeed, there is less than 1 chance in 10°° that all of the marbles selected for 
removal will be blue.'* Conversely, there is an extremely high probability that 
the remaining batch will still include both blue and red marbles. 

In a much smaller group of, say, eight marbles, evenly divided between four 
red and four blue, the probability of selecting four blue marbles at random and 
leaving only red marbles, though unlikely, is not prohibitively small. There’s 
now a much higher chance—1 in 70—that the remaining population of marbles 
will be all red.!° By starting with a smaller number of marbles, the probability 
that random selection will result in a population of uniform color is much higher. 
In a similar way, the probability of fixing a genetic trait in a population of 
organisms decreases exponentially with the size of the population. 

In formulating punctuated equilibrium, Gould realized that new species 
would inevitably have to arise in smaller populations, where random processes 
could have a greater chance of fixing traits. Prominent among those random 
processes is one called genetic drift. This occurs when genetic changes spread or 
disappear randomly through a population, without regard for their effect on 
survival and reproduction. 

In Gould and Eldredge’s view, allopatric speciation helped to explain how 
evolution could occur in larger, more discrete jumps than Darwinian gradualism 
predicts (see Fig. 7.2). As allopatric speciation occurs, it can generate what 
Gould and Eldredge conceived as sibling or offspring species. They thought that 
the processes that drive these speciation events occur relatively quickly in 
smaller populations, thus helping to explain the sudden jumps in the fossil 
record. As they put it: “Small numbers and rapid evolution virtually preclude the 
preservation of speciation events in the fossil record.”!* As they envisioned the 
evolutionary process working, the branches on the tree of life would split off so 
abruptly that they would appear as virtually “horizontal” lines, producing sudden 


discontinuities in the fossil record and therefore fewer fossilized intermediates. 
Eldredge and Gould explained it this way: “The theory of allopatric (or 
geographic) speciation suggests a different interpretation of paleontological data. 
If new species arise very rapidly in small, peripherally isolated populations, then 
the expectation of insensibly graded fossils is a chimera. A new species does not 
evolve in the area of its ancestors; it does not arise from the slow transformation 
of all its forbears.” Thus, they concluded, “Many breaks in the fossil record are 





real.” !° 
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Two views of the history of life. Figure 7.2a (left): The traditional Darwinian 
picture showing slow, gradual change. Figure 7.2b (right): The history of life as 
depicted by the theory of punctuated equilibrium showing rapid speciation. 


Gould, Eldredge, and Stanley thought that members of these sibling or 
offspring species would, subsequent to their origin by allopatric speciation, 
compete against each other for resources and survival, just as, in neo-Darwinism, 
individual organisms or siblings may compete to survive and reproduce within a 
population. In their view, if members of one species succeed over another 
because of some selective advantages they possess, then that species will survive 
and predominate, passing on its traits. This process of interspecies or 
interpopulation competition (as opposed to intraspecies competition) Gould 


called “species selection.” !® 


As Gould himself explained: “I propose, as the central proposition of 
macroevolution, that species play the same role of fundamental individual that 
organisms assume in microevolution. Species represent the basic units in 
theories and mechanisms of macroevolutionary change.”!’ Since natural 
selection then would act upon large differences in overall biological form— 
differences between whole species as opposed to individuals within species— 
evolutionary change would take place in bigger, more discrete jumps.!® 

Gould and Eldredge thus did not expect the fossil record to document many 
intermediates. Instead, they thought the “gaps” in the fossil record were “the 
logical and expected result of the allopatric model of speciation” as well as the 
closely related mechanism of species selection.!° Species selection made “the 
species” the unit of selection; the allopatric model of speciation asserted that 
new species quickly arise from smaller populations of organisms. Both 
mechanisms implied that fewer fossil intermediates would be _ preserved. 
According to punctuated equilibrium, the long “missing” transitional 
intermediates are not missing after all. In the process of species selection, the 
species, rather than the individual organism, competes for survival and thus 
becomes—in the jargon of evolutionary biology—the main “unit of selection” in 
macroevolution. 


“PUNK EEK” AND THE FOSSIL RECORD ELDREDGE AND GOULD 
DEVISED THE THEORY OF PUNCTUATED EQUILIBRIUM TO 
ELIMINATE THE CONFLICT BETWEEN THE FOSSIL RECORD AND 
EVOLUTIONARY THEORY. NEVERTHELESS, PUNCTUATED 
EQUILIBRIUM HAS ITS OWN PROBLEMS ACCOUNTING FOR THE 
FOSSIL RECORD. IN PARTICULAR, THE PATTERN OF FOSSIL 
APPEARANCE IN THE CAMBRIAN PERIOD IS INCONSISTENT WITH 
BOTH THE WAY IN WHICH PUNCTUATED EQUILIBRIUM DEPICTS THE 
HISTORY OF LIFE AND WITH THE IDEA THAT ALLOPATRIC 
SPECIATION AND SPECIES SELECTION ARE RESPONSIBLE FOR THAT 
PATTERN. THERE ARE SEVERAL REASONS FOR THIS. 


First, the top-down pattern of appearance of Cambrian animal forms that we 
saw in Chapter 2 contradicts punctuated equilibrium’s depiction of the history of 
life almost as much as it does the Darwinian picture (see Fig. 2.11). Recall that 
Darwin thought that the first representatives of the higher taxonomic categories 


emerged after the first appearance of representatives of each of the lower level 
taxa—that small differences distinguishing, for example, one species from 
another should gradually accumulate until they produced organisms different 
enough to be classified, first, as different genera, then, as different families, and 
eventually as different orders, classes, and so on. Instead, the first Cambrian 
animal forms are different enough from each other to justify classifying them as 
separate classes, subphyla, and phyla from their first appearance in the fossil 
record (see Fig. 7.3). 

This pattern creates an acute difficulty for the theory of punctuated 
equilibrium., First, due to the action of allopatric speciation and species 
selection, advocates of punctuated equilibrium envision morphological change 
(represented as horizontal distance in Figure 7.2) arising in larger, more 
discontinuous increments of change. Nevertheless, like neo-Darwinists, they too 
see phyla-level differences arising from the “bottom up,” starting with lower 
level taxonomic differences—albeit occurring in increments involving whole 
new species rather than individuals or varieties within species. Indeed, according 
to the theory of punctuated equilibrium, allopatric speciation first produces new 
species in smaller geographically isolated populations. For representatives of 
higher taxonomic categories to arise, these new species must accumulate new 
traits and evolve further. For this reason, punctuated equilibrium also expects 
small-scale diversity and differentiation of new species to precede the emergence 
of larger-scale morphological disparity and taxonomic differences. It also 
expects a “bottom-up” rather than a “top-down” pattern of appearance (see Fig. 
7.3). 

Second, for species selection to produce many new species, such as those that 
arise in the Cambrian explosion, a large pool of different species must first exist. 
The Precambrian fossil record does not document, however, the existence of 
such a large and diverse pool of competing Precambrian species upon which 
species selection (via allopatric speciation) might operate. 

Paleontologists Douglas Erwin and James Valentine exposed this problem in 
1987 in a seminal paper titled “Interpreting Great Developmental Experiments: 
The Fossil Record.”?? They questioned the ability of both of the main 
evolutionary theories of the time—punctuated equilibrium and neo-Darwinism 
—to explain the pattern of fossil appearance in the Precambrian—Cambrian fossil 
record.*! Clearly, neo-Darwinism does not explain this pattern. But, as Valentine 
and Erwin argue, neither does punctuated equilibrium. As they conclude, the 
mechanism of species selection requires a large pool of species upon which to 


act. Thus, Valentine and Erwin conclude: “The probability that species selection 
































is a general solution to the origin of higher taxa is not great.”*? 
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This figure shows that the theory of punctuated equilibrium (on left), like neo- 
Darwinism, anticipates small-scale diversity preceding large-scale disparity in 
form in contrast to the pattern in the fossil record (shown on right). Punctuated 
equilibrium also anticipates a “bottom-up,” rather than a “top-down,” pattern. 


The late-Precambrian and Cambrian fossil records present another difficulty 
for punctuated equilibrium. Though Gould and Eldredge envisioned new traits 
becoming fixed in small isolated populations where speciation eventually occurs, 
they envisioned these traits first arising during periods of stasis in the large 
populations from which the smaller populations later separated. Gould realized 
that only stable large populations would afford enough opportunities for 
mutations to generate the new traits that macroevolution requires.*? At the same 
time, he recognized that these new traits would have a far greater chance of 
being fixed into small, isolated populations where the random loss of some traits 
makes the fixation of others more likely (recall the marble example). By relying 
on large populations to generate new traits and small populations to fix them 
throughout a population, Gould wanted to provide both a plausible (if finely 
tuned) mechanism to explain both macroevolutionary change and the absence of 


fossil intermediates.** The late University of Chicago paleontologist Thomas J. 
M. Schopf described the balance this way, under punctuated equilibrium, 
evolution proceeds “in populations large enough to be reasonably variable, but 
small enough to permit large changes in gene frequencies due to random drift.”?° 

But by relying on the accumulation of new traits within large parent 
populations, Gould undercut his own rationale for concluding that the fossil 
record should not preserve many intermediate forms. The reason for this is 
obvious: if novel genetic traits arise and spread within a large population of 
organisms, they are more likely to leave behind fossil evidence of their 
existence. Organisms with new and unique combinations or mosaics of traits 
represent nothing less than new forms of life. Thus, the process by which Gould 
envisions new genetic traits arising in large populations implies that new forms 
of life—some presumably transitional to other forms—should be preserved in 
the fossil record. Yet the Precambrian fossil record fails to preserve such a 
wealth of biological experiments during the long periods of relative stability in 
large populations that Gould’s theory envisions. 


THE TESTIMONY OF STATISTICAL PALEONTOLOGY STUDIES IN 
STATISTICAL PALEONTOLOGY HAVE RAISED ADDITIONAL 
QUESTIONS ABOUT WHETHER THE FOSSIL RECORD DOCUMENTS 
ENOUGH TRANSITIONAL INTERMEDIATES TO RENDER 
PUNCTUATED EQUILIBRIUM CREDIBLE. IN CHAPTER 3, I DISCUSSED 
THE WORK OF THE STATISTICAL PALEONTOLOGIST MICHAEL 
FOOTE. FOOTE USED SAMPLING THEORY TO ARGUE THAT THE 
FOSSIL RECORD PROVIDES A REASONABLY COMPLETE PICTURE OF 
THE FORMS OF LIFE THAT HAVE EXISTED ON EARTH AND TO 
SUGGEST THAT PALEONTOLOGISTS ARE UNLIKELY TO FIND THE 
MANY INTERMEDIATE FORMS THAT NEO-DARWINIAN THEORY 
REQUIRES. 


Foote has also analyzed the question of whether the fossil record documents 
the number of intermediate forms that punctuated equilibrium requires. His 
answer: it depends. 

Foote notes that whether a particular version of evolutionary theory can 
account for patterns in the fossil record depends upon the kind of mechanism of 
change it invokes. Neo-Darwinism relies upon a slow, gradually acting 
mechanism of change and, thus, has difficulty accounting for evidence of sudden 


appearance. Foote analyzes whether the number of proposed transitional species 
in the fossil record is consistent with punctuated equilibrium and concludes that 
it depends upon how fast the mechanisms upon which it relies can generate new 
forms of life. Though its proponents envision the evolution of new forms of life 
arising more abruptly than do the neo-Darwinists, they would still expect that the 
fossil record should have preserved some transitional fossils. Does the fossil 
record preserve even the relatively few intermediate forms that the theory of 
punctuated equilibrium implies that it should? 

To answer that question, Foote developed a statistical method of testing the 
adequacy of different evolutionary models against several variables.2° He 
observed that for punctuated equilibrium to succeed as an explanation for the 
data of the fossil record, it needs a mechanism capable of producing major 
evolutionary change quickly, because only such fast-acting change could account 
for the relative paucity of transitional forms in the fossil record. As Foote 
explained (writing with Gould in fact), the adequacy of punctuated equilibrium 
as an account of the fossil record depends upon the existence of a mechanism “of 
unusual speed and flexibility.” 

But does the theory identify such a mechanism? 


FROM WHENCE NEW TRAITS AND FORM? 


Neither allopatric speciation nor species selection can generate the new genetic 
and anatomical traits necessary to produce animal forms, let alone in the 
relatively brief time of the Cambrian explosion. As conceived by Gould and the 
other advocates of punctuated equilibrium, allopatric speciation just allows for 
the possibility of the rapid fixation of preexisting traits, not the generation of 
new traits. When a parent population splits into two or more daughter 
populations, each of the daughter populations retains a part, but usually not the 
whole, of the gene pool of the original population. No new genetic traits are 
generated by the geographical isolation of one part of a population from another. 

It could be argued, of course, that mutations might occur during the process 
of speciation, thus generating new genetic traits. But as Gould and Eldredge 
conceived of it, allopatric speciation occurs much too rapidly to have a 
reasonable chance of mutations generating anything fundamentally new. Darwin 
recognized in On the Origin of Species that evolution is a numbers game: larger 
population sizes and more generations offer more opportunities for favorable 
new variations to arise. As he explained: “Forms existing in larger numbers will 


always have a better chance . . . of presenting further favourable variations for 
natural selection to seize on, than will the rarer forms which exist in lesser 
numbers.”“° Yet for the mechanism of allopatric speciation to generate new 
traits, it would need to generate significant changes in form in small 
“peripherally isolated” populations over relatively few generations.*? Because of 
these constraints, many biologists have concluded that allopatric speciation 
requires too much change too quickly to provide the theory of punctuated 
equilibrium with a biologically plausible mechanism for producing new traits or 
forms of animal life. 

And that is why Gould and Eldredge, especially in their later formulations of 
the theory, envisioned new traits arising during long periods of stasis in larger 
populations rather than during short bursts of speciation. But a process in which 
traits arise “during long periods of stasis” does not constitute a “mechanism of 
unusual speed and flexibility,” though that is precisely what, according to Gould 
and Foote, punctuated equilibrium requires in order to explain the abrupt 
appearance of new animal forms. 

If allopatric speciation does not produce a fast-acting trait-generating 
mechanism, does species selection? Again, the answer is no. Species selection 
does not account for the origin of the different anatomical traits that distinguish 
one species from another. Species selection, as conceived by the proponents of 
punctuated equilibrium, acts on species and traits that already exist. Indeed, 
when Stanley, Gould, and Eldredge envisioned natural selection acting to favor 
the most fit species over another in a competition for survival, they presupposed 
the existence of a pool of different species and, therefore, also the existence of 
some mechanism for producing the traits that characterize those different 
species. That mechanism, however, would necessarily need to generate those 
differentiating traits before species could enter into competition with each other. 
Species selection eliminates less fit species in a competition for survival; it does 
not generate the traits that distinguish species and establish the basis for 
interspecies competition. 

So where do these traits come from? When pressed, Gould eventually 
acknowledged that the origin of anatomical traits themselves result from good, 
old-fashioned natural selection acting on random mutations and variations—that 
is, from the neo-Darwinian mechanism acting over long periods of time on large 
relatively stable populations. But that meant that punctuated equilibrium, to the 
extent it relies on mutation and natural selection, is subject to the same evidential 
and theoretical problems as neo-Darwinism. And one of those problems is that 


the neo-Darwinian mechanism does not act quickly enough to account for the 
explosive appearance of new fossil forms in the Cambrian period. Like allopatric 
speciation, species selection does not qualify as the kind of rapid and flexible 
mechanism that Gould elsewhere insisted his theory must have in order to 
explain the abrupt appearance of animal forms in the fossil record. 


NOVEL FORM AND MECHANISM AN EVEN MORE PROFOUND 
DIFFICULTY WITH PUNCTUATED EQUILIBRIUM AS AN 
EXPLANATION FOR THE CAMBRIAN EXPLOSION REMAINS. NEITHER 
SPECIES SELECTION NOR ALLOPATRIC SPECIATION EXPLAINS THE 
ORIGIN OF THE REPRESENTATIVES OF THE HIGHER TAXONOMIC 
CATEGORIES—THAT IS, THE NEW ANIMALS REPRESENTING NEW 
PHYLA AND CLASSES. NOR DOES IT EXPLAIN THE STRUCTURAL 
AND MORPHOLOGICAL FEATURES THAT DISTINGUISH ANIMALS 
FROM ONE ANOTHER AND EARLIER FORMS OF LIFE. ALLOPATRIC 
SPECIATION EXPLAINS HOW POPULATIONS GET SEPARATED FROM 
EACH OTHER TO FORM DIFFERENT SPECIES. SPECIES SELECTION 
DESCRIBES HOW MORE FIT SPECIES PREDOMINATE OVER OTHER 
SPECIES IN A COMPETITION FOR SURVIVAL. NEITHER MECHANISM 
GIVES ANY ACCOUNT OF HOW THE ANIMALS REPRESENTING THE 
SPECIFICALLY HIGHER TAXA OR THEIR DISTINCTIVE ANATOMICAL 
NOVELTIES AROSE. NEITHER MECHANISM ACCOUNTS, FOR 
EXAMPLE, FOR THE ORIGIN OF THE COMPOUND EYE OF A 
TRILOBITE, NOR THE GILLS OF A CAMBRIAN FISH,°? NOR THE 
ECHINODERM BODY PLAN. 


Many critics of punctuated equilibrium have noted this problem. As Richard 
Dawkins wrote in 1986: “What I mainly want a theory of evolution to do is 
explain complex, well-designed mechanisms like hearts, hands, eyes and 
echolocation. Nobody, not even the most ardent species selectionist, thinks that 
species selection can do this.”*! Or as paleontologist Jeffrey Levinton argued in 
1988, “It is inconceivable how selection among species can produce the 
evolution of detailed morphological structures. . . . Species selection did not 
form an eye.”°? 

So where do these intricate structures come from? Again, when pressed, 
Gould resorted to the alleged power of the neo-Darwinian mechanism. As he 
wrote in his magisterial tome The Structure of Evolutionary Theory, published in 


2002, the year of his death: “I do not deny either the wonder, or the powerful 
importance, of organized adaptive complexity.” He went on to concede, “I 
recognize that we know no mechanism for the origin of such organismal features 
other than conventional natural selection at the organismic level.”°° 

For this reason, few if any evolutionary biologists now regard punctuated 
equilibrium as a solution to the problem of the origin of biological form and 
novelty. As the evolutionary biologists Brian Charlesworth, Russell Lande, and 
Montgomery Slatkin have concluded, “genetic mechanisms that have been 
proposed [by proponents of punctuated equilibrium] to explain the abrupt 
appearance and prolonged stasis of many species are conspicuously lacking in 
empirical support.”°4 


BURST OF INTEREST AND GRADUAL DECLINE STILL, IT MAY NOT 
BE ENTIRELY FAIR TO CRITICIZE PUNCTUATED EQUILIBRIUM FOR 
FAILING TO ACCOUNT FOR THE CAMBRIAN EXPLOSION. GOULD, IN 
PARTICULAR, EQUIVOCATED ABOUT WHETHER HE MEANT 
PUNCTUATED EQUILIBRIUM TO SERVE AS A COMPREHENSIVE 
THEORY OF MACROEVOLUTIONARY CHANGE OR JUST AN 
ACCOUNT OF HOW NEW SPECIES EMERGED FROM A POOL OF 
PREEXISTING SPECIES. STRICTLY SPEAKING, THE MECHANISMS OF 
ALLOPATRIC SPECIATION AND SPECIES SELECTION SOUGHT TO 
EXPLAIN THE PATTERN OF STASIS AND DISCONTINUITY AMONG 
DIFFERENT SPECIES AND NOT AMONG THE HIGHER TAXA. THUS, 
NEAR THE END OF HIS CAREER, GOULD COMPLAINED ABOUT HIS 
CRITICS “MISUNDERSTANDING” HIS THEORY BY ASSERTING THAT 
HE “PROCLAIMED THE TOTAL OVERTHROW OF DARWINISM” AND 
“INTENDED PUNCTUATED EQUILIBRIUM AS BOTH THE AGENT OF 
DESTRUCTION AND THE REPLACEMENT.””? 


Yet Gould and Eldredge, at least initially, advanced punctuated equilibrium as 
a bold new theory of evolutionary biology, giving the impression that it provided 
an ambitious solution to the problem of macroevolution and, by implication, 
events such as the Cambrian explosion. From 1972 to 1980, Eldredge and Gould 
presented a series of provocative scientific papers that portrayed punctuated 
equilibrium as a bold, and even revolutionary, alternative theory of 
macroevolution. Indeed, Gould himself referred to it explicitly as “a speciational 


theory of macroevolution.”°° 


In their second main paper, published in 1977, Gould and Eldredge made 
explicit their intention to position their theory as a “radical”°’ challenge to neo- 
Darwinian gradualism and to replace it with a completely different 
understanding of the mode and mechanism of evolutionary change. Sepkoski 
notes that in this 1977 article “the authors were more explicit about the exact 
nature of the conceptual reconfiguration their theory brought to 
macroevolution.”*° In particular, he argues that Gould and Eldredge “extended 
their model to propose a new and ‘general philosophy of change’ in the natural 
world.”°9 Gould was no less radical in a widely cited 1980 paper in the journal 
Paleobiology in which he offered punctuated equilibrium as “a new and general 
theory” of evolution. There he also famously declared the synthetic theory of 
neo-Darwinism “effectively dead, despite its persistence as _ textbook 
orthodoxy.”4? 

Only after critics exposed punctuated equilibrium for lacking an adequate 
mechanism did Gould retreat to a more conservative formulation of the theory, 
making its reliance upon the neo-Darwinian mechanism explicit. From the early 
1980s until his death in 2002, Gould made a series of concessions in particular 
about the inadequacy of speciation and species selection as mechanisms for 
generating complex adaptations. Thus, as Sepkoski notes, “Despite the brashness 
of many of his claims on behalf of punctuated equilibrium over the years, one is 
brought time and again back to the reconciliatory, even conservative 
justifications Gould made for his theory,” particularly, he notes, in The Structure 
of Evolutionary Theory, written in the years just before Gould’s death.*! 

In the end, Gould’s concessions to neo-Darwinism brought his thinking back 
into conflict with the pattern of sudden appearance in the fossil record that the 
theory of punctuated equilibrium was designed to explain. If Gould and Eldredge 
were right about the abrupt appearance of new forms of life in the fossil record, 
and if the neo-Darwinian mechanism needs as much time as evolutionary 
biologists and population geneticists (see Chapters 8 and 12) calculate, then the 
mutation and selection mechanism does not have enough time to produce the 
new traits needed to build the forms of life that first appear in the Cambrian 
period. But punctuated equilibrium, as initially formulated to rely mainly on 
allopatric speciation and species selection, fared no better, since neither 
mechanism gives any explanation for the origin of new traits. And so, in the end, 
punctuated equilibrium highlighted rather than resolved a profound dilemma for 
evolutionary theory: neo-Darwinism allegedly has a mechanism capable of 
producing new genetic traits, but it appears to produce them too slowly to 


account for the abrupt appearance of new form in the fossil record; punctuated 
equilibrium attempts to address the pattern in the fossil record, but fails to 
provide a mechanism that can produce new traits whether abruptly or otherwise. 
No wonder, then, that leading Cambrian paleontologists such as James Valentine 
and Douglas Erwin concluded in 1987, that “neither of the contending theories 
of evolutionary change at the species level, phyletic gradualism or punctuated 
equilibrium, seem applicable to [explaining] the origin of new body plans.” 


SPINNING IN CIRCLES IN A SUDDEN FLASH OF INSIGHT IN THE 
HUMDRUM OF A LAUNDROMAT, NILES ELDREDGE REALIZED THAT 
STASIS IN THE FOSSIL RECORD REPRESENTED EVIDENCE RATHER 
THAN MERE INVESTIGATIVE FAILURE. BUT LIKE LAUNDRY 
SPINNING AROUND IN A WASHING MACHINE, THE THEORY OF 
PUNCTUATED EQUILIBRIUM ITSELF BECAME CAUGHT IN A DREARY 
CYCLE OF CONTRADICTION. ON THE ONE HAND, “PUNK EEK” MADE 
A BOLD ATTEMPT TO DESCRIBE MORE ACCURATELY, AND EVEN 
EXPLAIN, THE DECIDEDLY DISCONTINUOUS PATTERN OF THE 
FOSSIL RECORD. ON THE OTHER, ITS ADVOCATES WERE FORCED TO 
CONCEDE BOTH THE INADEQUACY OF THEIR PROPOSED 
MECHANISMS AND THEIR NEED TO RELY UPON THE NEO- 
DARWINIAN PROCESS OF MUTATION AND SELECTION IN ORDER TO 
ACCOUNT FOR THE ORIGIN OF NEW GENETIC TRAITS AND 
ANATOMICAL INNOVATIONS. AFTER GOULD APPEARED TO 
JETTISON BOTH GRADUALISM AND A RELIANCE ON THE NEO- 
DARWINIAN MECHANISM IN ORDER TO BRING EVOLUTIONARY 
THEORY INTO CONFORMITY WITH THE FOSSIL RECORD, HE 
EVENTUALLY ACKNOWLEDGED THAT HE COULD NOT EXPLAIN THE 
ORIGIN OF THE FORMS OF LIFE DOCUMENTED IN THE FOSSIL 
RECORD APART FROM THAT SAME SLOW AND GRADUALLY ACTING 
MECHANISM. THUS, THOUGH THE THEORY OF PUNCTUATED 
EQUILIBRIUM WAS INITIALLY PRESENTED AS A SOLUTION TO THE 
MYSTERIOUS AND SUDDEN ORIGIN OF ANIMAL FORMS, UPON 
CLOSER INSPECTION, IT FAILED TO OFFER SUCH A SOLUTION. 


Nevertheless, the failure of punctuated equilibrium to provide a sufficient 
mechanism has raised questions about the adequacy of the mechanism that 
Gould ultimately did reaffirm as the explanation for the origin of novel 


biological form. Can the neo-Darwinian mechanism of natural selection acting 
on random mutations build new forms of animal life with all their complex 
adaptations? If so, is it possible that it could do so in the brief time allowed by 
the fossil record? If not, is it reasonable to think that it could build new forms of 
animal life if only more time were available? If so, how much time would the 
Darwinian mechanism need to build complex adaptations and new forms of 
animal life? In the next several chapters, I will address these fundamental 
questions at the heart of the Cambrian mystery—dquestions, in brief, about how 
to build an animal. 


Part Two 


How to Build an Animal 
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The Cambrian Information Explosion 


When I was a college professor, I used to ask my students a question: “If you 
want your computer to acquire a new function or capability, what do you have to 
give it?” Typically, I would hear a smattering of similar answers from the class: 
“code,” “instructions,” “software,” “information.” Of course, all these are 
correct. And thanks to discoveries in modern biology, we now know that 
something similar is true of life: to build a new form of life from a simpler 
preexisting form requires new information. 

To this point I’ve examined one main aspect of the mystery surrounding the 
Cambrian explosion: the mystery of the missing Precambrian ancestral forms 
expected on the basis of Darwin’s theory. The next group of chapters will 
examine a second, and perhaps more profound, aspect of the Cambrian mystery: 
that of the cause of the Cambrian explosion. By what means or process or 
mechanism could something as complex as a trilobite have arisen? Could natural 
selection have accomplished such a feat? To answer this question we will have to 
look more closely at what it takes to build a new form of animal life. And we’Il 
see that an important part of the answer to that question will have something to 
do with the concept of information. 


” ce 


THE DARWINIAN ACCOUNT OF THE ORIGIN OF ANIMAL FORM 


As Darwin envisioned the process, natural selection can accomplish nothing 
without a steady supply of variation as a source of new biological traits, forms, 
and structures. Only after useful new variations arise can natural selection sift 
them from the chaff of unhelpful variations. If, however, the amount of variation 
available to natural selection is limited, then natural selection will encounter 
limits on how much new biological form and structure it can build. 

Even in the late nineteenth century, many leading scientists recognized this. 
For this reason, there has been a long history of scientific controversy about just 


how much novelty natural selection can produce and about whether natural 
selection is a truly creative process. In fact, between 1870 and 1920 classical 
Darwinism entered a period of eclipse, because many scientists thought that it 
could not explain the origin and transmission of new heritable variation.! 

Darwin favored a theory of blended inheritance that seemed to imply 
limitations on the amount of genetic variability.* He thought that when parents 
with different traits combined germ cells during sexual reproduction, the 
resulting offspring would receive not one or the other set of their differing traits, 
but instead a compromise version. For example, if a male bird with red feathers 
on its wings mated with a female bird of the same species with white wing 
feathers, the theory implied that the two would likely produce offspring with 
pink wing feathers. As many of Darwin’s contemporaries pointed out, such 
instances of blending inheritance involved strict limitations on the range of traits 
that could possibly arise, depriving natural selection of the wide-ranging supply 
of variation it would need to produce truly fundamental changes in the form of 
animals. The pink-feathered offspring might later reproduce with a white-or red- 
feathered bird of the same species, producing a slightly lighter or darker shade of 
pink feathers. Nevertheless, descendants of the original white-and red-feathered 
pair would never produce green, blue, or yellow feathers in subsequent 
generations. If correct, blending inheritance would eventually lead to a bland, 
homogenous, variationless state in a population. 

In the 1860s, the Austrian monk Gregor Mendel, widely regarded as the 
founder of modern genetics, showed in his work on garden peas that Darwin’s 
assumptions about blending inheritance were incorrect. The results of his studies 
created, at least initially, more problems for Darwinism. Mendel showed that the 
genetic traits of organisms typically have an integrity that resists blending. He 
showed this by cross-pollinating plants with yellow peas and plants with green 
peas. The plants in the subsequent generations produced either yellow or green 
peas, but nothing in between and nothing with an altogether different color. 

He also showed that the plants carried some kind of signal or instructions for 
building different traits even when the trait was not on display in a particular 
plant. He noticed, for example, that when he crossed pea plants with green and 
yellow seeds, the next generation had only yellow seeds, almost as if the 
capacity for generating green peas had been lost. But when he cross-pollinated 
the second-generation plants, the ones with only yellow peas, he found that both 
yellow and green peas emerged in the third generation, in a ratio of 3 to 1. From 
this Mendel hypothesized that the second-generation plants continued to carry 


signals, which he called “factors,” and later scientists called genes, for 
generating green peas even when those plants themselves did not display that 
trait. 

The classical Mendelian genetics that replaced Darwin’s blending theory of 
inheritance also suggested limitations on the amount of genetic variability 
available to natural selection. If plant reproduction produced either green or 
yellow peas but never some intermediate form, and if the signals for producing 
the green traits and yellow traits persisted unchanged from generation to 
generation, it was difficult to see how sexual reproduction and genetic 
recombination could produce anything more than unique combinations of 
already existing traits. 

In the decades immediately after Mendel’s work, geneticists came to 
understand genes as discrete units or packets of hereditary information that could 
be independently sorted and shuffled within the chromosome. This too suggested 
that a significant, but still strictly limited, amount of genetic variation could arise 
by genetic recombination during sexual reproduction. Thus, Mendelian genetics 
raised significant questions about whether the process of natural selection has 
access to enough variation (which, after Mendel, was conceived as genetic 
variation) to allow it to produce any significant morphological novelty. For a 
time, Darwin’s theory was in retreat. 


DARWINISM MUTATES 


During the 1920s, 1930s, and 1940s, however, developments in genetics revived 
natural selection as the main engine of evolutionary change. Experiments 
performed by Hermann Muller in 1927 showed that X-rays could alter the 
genetic composition of fruit flies, resulting in unusual variations.* Muller called 
these X-ray-induced changes “mutations.” Other scientists soon reported that 
they had produced mutations in the genes of other organisms, including humans. 
Whatever genes were made of—and biologists then still did not know—these 
developments suggested they could vary more than either Darwin or classical 
Mendelian genetics had assumed. Geneticists at the time also discovered that 
these small-scale changes in genes were potentially heritable.? But if variant 
versions of genes were heritable, then presumably natural selection could favor 
advantageous gene variants and eliminate the others. These mutations could then 
influence the future direction of evolution and, in theory at least, provide an 
unlimited supply of variation for natural selection’s workshop. 

The discovery of genetic mutations also suggested a way to reconcile 
Darwinian theory with insights from Mendelian genetics. During the 1930s and 
1940s, a group of evolutionary biologists, including Sewall Wright, Ernst Mayr, 
Theodosius Dobzhansky, J. B. S. Haldane, and George Gaylord Simpson, 
attempted to demonstrate this possibility using mathematical models to show 
that small-scale variations and mutations could accumulate over time in whole 
populations, eventually producing large-scale morphological change.° These 
mathematical models formed the basis of a subdiscipline of genetics known as 
population genetics. The overall synthesis of Mendelian genetics with Darwinian 
theory came to be called “neo-Darwinism” or simply the “New Synthesis.” 

According to this new synthetic theory, the mechanism of natural selection 
acting upon genetic mutations suffices to account for the origin of novel 
biological forms. Small-scale “microevolutionary” changes can accumulate to 
produce large-scale “macroevolutionary” innovations. The neo-Darwinists 
argued that they had revived natural selection by discovering a specific 
mechanism of variation that could generate new forms of life from simpler 
preexisting ones. By the centennial celebration of Darwin’s Origin of Species in 
1959, it was widely assumed that natural selection and random mutations could 
indeed build new forms of life over the course of time with their distinctive body 
plans and novel anatomical structures. At the celebration, Julian Huxley, the 


grandson of T. H. Huxley, summarized this optimism in a grand proclamation: 


Future historians will perhaps take this Centennial Week as epitomizing 
an important critical period in the history of this earth of ours—the period 
when the process of evolution, in the person of inquiring man, began to 
truly be conscious of itself... . This is one of the first public occasions on 
which it has been frankly faced that all aspects of reality are subject to 
evolution, from atoms and stars to fish and flowers, from fish and flowers 
to human societies and values—indeed, that all reality is a single process 
of evolution . . .” 


In a television broadcast leading up to the Centennial celebration, Huxley 
captured the optimistic mood more succinctly: “Darwinism has come of age, so 
to speak. We are no longer having to bother about establishing the fact of 
evolution.”® 


VARIATION AS INFORMATION 


Initially, the elucidation of the structure of DNA by James Watson and Francis 
Crick in 1953 contributed to this euphoria.” Indeed, it seemed to lift the 
mechanism of genetic variation and mutation out of the mist and into the clear 
light of the emerging science of molecular biology. Watson and Crick’s 
elucidation of the double helix structure of DNA suggested that DNA stored 
genetic information in the form of a four-character digital and chemical code 
(see Fig. 8.1). Later, following the formulation of Francis Crick’s famed 
“sequence hypothesis,” molecular biologists confirmed that the chemical 
subunits along the spine of the DNA molecule called nucleotide bases function 
just like alphabetic characters in a written language or digital characters in a 
machine code. Biologists established that the precise arrangement of these 
nucleotide bases conveyed instructions for building proteins.!° (See Fig. 8.2.) 
Molecular biologists also determined that this store of genetic information in 
DNA is transmitted from one generation of cells and organisms to another. In 
short, it was established that DNA stores hereditary information for building 
proteins and thus, presumably, for building higher-order anatomical traits and 
structures as well. 

The elucidation of the double helix seemed to resolve some long-standing 
issues in evolutionary biology. Darwinists had long maintained that natural 
selection produced new forms by separating the proverbial wheat from the chaff 
of genetic variation, but they didn’t know where the raw material for all of the 
competing variations resided. Neither did they know how genes stored 
information for producing the traits associated with them. Further, even after 
geneticists discovered that stable genetic traits could be altered by mutations, 
they remained uncertain about what exactly it was that was being “mutated.” 
Consequently, biologists were uncertain about exactly where variations and 
mutations occurred. 








FIGURE. 8.1 

James Watson (left) and Francis Crick (right) presenting their model of the 
structure of the DNA molecule in 1953. Courtesy A. Barrington Brown/Science 
Source. 


Watson and Crick’s model suggested an answer to that question: genes 
correspond to long sequences of bases on a strand of DNA. Building on that 
insight, evolutionary biologists proposed that new variations arose, first, from 
the genetic recombination of different sections of DNA (different genes) during 
sexual reproduction and, second, from a special kind of variation called 
mutations that occur from random changes in the arrangement of nucleotide 
bases in DNA. Just as a few typographical errors in an English sentence might 
alter the meaning of a few words or even the whole sentence, so too might a 
change in the sequential arrangement of the bases in the genetic “text” in DNA 
produce new proteins or morphological traits. 
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FIGURE 8.2 


The model (or structural formula) of the DNA molecule showing the digital or 
alphabetic character of the nucleotide bases stored along the sugar-phosphate 
backbone of the molecule. 


Watson and Crick’s discovery also raised new questions—in particular, 
questions about the information necessary to build completely new forms of life 
during the course of biological evolution. True, mutations play a role in this 
process, but could they generate enough information to produce novel forms of 
animal life such as those that arose in the Cambrian period, an explosion—a vast 
proliferation—of new biological information? 


THE CAMBRIAN INFORMATION EXPLOSION 


Consider choanoflagellates, a group of single-celled eukaryotic organisms with a 
flagellum. What separates such organisms from a trilobite or a mollusk or even a 
lowly sponge? Clearly, all three of these higher forms of life are more complex 
than any one-celled organism. But exactly how much more complex? 

James Valentine has noted that one useful way of comparing degrees of 
complexity is to assess the number of cell types in different organisms (see Fig. 
8.3)./' Though a single-celled eukaryote has many specialized internal structures 
such as a nucleus and various organelles, it still, obviously, represents just a 
single type of cell. Functionally more complex animals require more cell types 
to perform their more diverse functions. Arthropods and mollusks, for example, 
have dozens of specific tissues and organs, each of which requires “functionally 
dedicated,” or specialized, cell types. 

These new cell types, in turn, require many new and specialized proteins. An 
epithelial cell lining a gut or intestine, for example, secretes a specific digestive 
enzyme. This enzyme requires structural proteins to modify its shape and 
regulatory enzymes to control the secretion of the digestive enzyme itself. Thus, 
building novel cell types typically requires building novel proteins, which 
requires assembly instructions for building proteins—that is, genetic 
information. Thus, an increase in the number of cell types implies an increase in 
the amount of genetic information. 
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FIGURE 8.3 











Biological complexity scale as measured in number of cell types of different 
organisms. 


Applying this insight to ancient life-forms underscores just how dramatic the 
Cambrian explosion was. For over 3 billion years, the living world included little 
more than one-celled organisms such as bacteria and algae.'* Then, beginning in 
the late Ediacaran period (about 555-570 million years ago), the first complex 
multicellular organisms appeared in the rock strata, including sponges and the 
peculiar Ediacaran biota discussed in Chapter 4.'° This represented a large 
increase in complexity. Studies of modern animals suggest that the sponges that 
appeared in the late Precambrian, for example, probably required about ten cell 
types. /4 

Then 40 million years later, the Cambrian explosion occurred.!° Suddenly the 
oceans swarmed with animals such as trilobites and anomalocaridids that 
probably required fifty or more cell types—an even greater jump in complexity. 
Moreover, as Valentine notes, measuring complexity differences by measuring 
differences in the number of cell types probably “greatly underestimate[s] the 
complexity differentials between bodyplans.”!® 

One way to estimate the amount of new genetic information that appeared 
with the Cambrian animals is to measure the size of the genomes of modern 
representatives of the Cambrian groups and compare that to the amount of 
information in simpler forms of life. Molecular biologists have estimated that a 
minimally complex single-celled organism would require between 318,000 and 
562,000 base pairs of DNA to produce the proteins necessary to maintain life.‘7 
More complex single cells might require upwards of a million base pairs of 
DNA. Yet to assemble the proteins necessary to sustain a complex arthropod 
such as a trilobite would need orders of magnitude more protein-coding 
instructions. By way of comparison, the genome size of a modern arthropod, the 
fruit fly Drosophila melanogaster, is approximately 140 million base pairs.‘® 
Thus, transitions from a single cell to colonies of cells to complex animals 
represent significant—and in principle measurable—increases in genetic 
information. 

During the Cambrian period a veritable carnival of novel biological forms 
arose. But because new biological form requires new cell types, proteins, and 
genetic information, the Cambrian explosion of animal life also generated an 
explosion of genetic information unparalleled in the previous history of life.'° 
(In Chapter 14, we’ll see that building a new animal body plan also requires 


another type of information, not stored in genes, called epigenetic information.) 

So can the neo-Darwinian mechanism explain the dramatic increase in 
genetic information that appears in the Cambrian explosion? Before addressing 
that question, it will help to define the concept of information and identify the 
kind of information that DNA contains. 


BIOLOGICAL INFORMATION: SHANNON OR OTHERWISE? 


Scientists typically recognize at least two basic types of information, functional 
(or meaningful) information and so-called Shannon information, which is not 
necessarily meaningful or functional. The distinction has arisen in part because 
of developments in a branch of applied mathematics known as information 
theory. During the late 1940s, mathematician Claude Shannon, working at the 
Bell Laboratories, developed a mathematical theory of information. Shannon 
equated the amount of information transmitted by a sequence of symbols or 
characters with the amount of uncertainty reduced or eliminated by the 
transmission of that sequence.7” 

Shannon thought that an event or communication that didn’t eliminate much 
uncertainty was also not very informative. Consider an illustration. In the 1970s, 
when I was a teenager, if someone made a completely obvious statement, we 
would say, “Tell me something else I didn’t know.” Imagine that one of my 
classmates on the baseball team has just rushed up to breathlessly “inform” me 
that our team’s star pitcher is planning to throw the ball to the catcher in the next 
game. Such a statement would earn the scornful reply, “Tell me something else I 
didn’t know.” 

The obvious statement about the pitcher’s intentions also illustrates why 
Shannon equated the elimination of uncertainty with the transmission of 
information. Since star pitchers who want to go on being star pitchers have no 
choice but to throw the ball across the plate to the catcher, the statement of my 
overwrought friend eliminated no previous uncertainty. It was not informative in 
the least. If, on the other hand, after days of speculation on campus about which 
one of our team’s four pitchers the baseball coach would choose to pitch in the 
championship game, my friend had rushed up to me and revealed the identity of 
the starting pitcher, that would be different. In that case, he would have 
eliminated some significant uncertainty on my part with a decidedly informative 
Statement. 

Shannon’s theory quantified the intuitive connection between reduced 


uncertainty and information by asserting that the more uncertainty an event or 
communication eliminated, the more information it conveyed. Imagine that after 
revealing to me the identity of the starting pitcher before the championship 
baseball game in the spring, he then also revealed to me the identity of the 
starting quarterback before the upcoming football season. Imagine as well that 
our baseball team had four equally competent pitchers and the football team had 
just two equally competent quarterbacks. Given these facts, my friend’s decision 
to inform me of the identity of the starting pitcher eliminated more uncertainty 
than his decision to reveal to me the identity of the starting quarterback. 

To assign precise quantitative measures of information, Shannon further 
linked the reduction of uncertainty and information to quantitative measures of 
probability (or improbability). Notice that in my _ illustration, the more 
informative communication reduced more uncertainty and also described a more 
improbable event. The probability of any one of the four pitchers being selected 
was 1 in 4. The probability of one of the quarterbacks being selected, given the 
same assumption, was only 1 in 2. The more improbable event eliminated more 
possibilities and more uncertainty. Thus, it conveyed more information. 

Shannon applied these intuitions to quantify the amount of information 
present in sequences of symbols or characters stored in texts or codes or 
transmitted across communications channels. Thus, in his theory the presence of 
an English letter in a sequence of other such letters transmits more information 
than a single binary digit (a zero or one) in a section of computer code. Why? 
Again, the letter in the English alphabet reduces uncertainty among twenty-six 
possibilities, whereas a single binary digit reduces uncertainty among only two. 
The probability of any one character from the English alphabet occurring in a 
sequence of other such letters (disregarding the need for spaces and punctuation) 
is 1 in 26. The probability of either a zero or one arising in a sequence of binary 
characters is 1 in 2. In Shannon’s theory the presence of the more improbable 
character conveys more information. 

Yet even a binary alphabet can convey an unlimited amount of information 
because, in Shannon’s theory, additional information is conveyed as 
improbabilities multiply. Imagine a grab bag of tiles with either zero or one 
etched onto each. Imagine someone producing a series of zeros and ones by 
reaching into the bag and placing them one by one on a game board. The 
probability of choosing a zero on the first pick is just 1 in 2. But the probability 
of choosing two consecutive zeros after placing the first back in the grab bag 
(and shaking the tiles) is 1 chance in 2 x 2, or 1 chance in 4. This is because 


there are four possible combinations of digits that could have been chosen—00, 
01, 10, or 11. Similarly, the probability of producing any three-letter sequence as 
a result of choosing consecutively in this manner is 1 in 2 x 2 x 2, or 1 in 2° (or 
1 in 8). The improbability of any specific sequence of characters increases 
exponentially with the number of characters in the sequence. Thus, longer and 
longer sequences can generate larger and larger amounts of information even 
using a simple binary alphabet. 

Information scientists measure such informational increases through a unit 
they call a bit. A bit represents the minimum amount of information that can be 
conveyed (or uncertainty reduced) by a single digit in a two-character 
alphabet.7! 

Biologists can readily apply Shannon’s information theory to measure the 
amount of Shannon information in a sequence of DNA bases (or the sequence of 
amino acids in a protein) by assessing the probability of the sequence occurring 
and then converting that probability to an information measure in bits.** DNA 
conveys information, in Shannon’s sense, in virtue of its containing long 
improbable arrangements of four chemicals—the four bases that fascinated 
Watson and Crick—adenine, thymine, guanine, and cytosine (A, T, G, and C). 
As Crick realized in formulating his sequence hypothesis, these nucleotide bases 
function as alphabetic or digital characters in a linear array. Since each of the 
four bases has an equal 1 in 4 chance of occurring at each site along the spine of 
the DNA molecule, biologists can calculate the probability, and thus the Shannon 
information, or what is technically known as the “information-carrying 
capacity,” of any particular sequence n bases long. For instance, any particular 
sequence three bases long has a probability of 1 chance in 4 x 4 x 4, or 1 chance 
in 64, of occurring—which corresponds to 6 bits of Shannon information. 
(Indeed, each base in a DNA sequence conveys 2 bits of information, since 1 in 
4 is equal to 1 chance in 2 x 2.) 

Yet the applicability of Shannon information theory to molecular biology has, 
to some degree, obscured a key distinction concerning the type of information 
that DNA possesses. Although Shannon’s theory measures the amount of 
information in a sequence of symbols or characters (or chemicals functioning as 
such), it doesn’t distinguish a meaningful or functional sequence from useless 
gibberish. For example: 


“we hold these truths to be self-evident” 
“ntnyhiznslhtgeqkahgdsjnfplknejmsed” 


These two sequences are equally long and equally improbable if we imagine 
them being drawn at random. Thus, they contain the same amount of Shannon 
information. Yet clearly there is an important qualitative distinction between 
them that the Shannon measurement does not capture. The first meaningful 
sequence performs a communication function, while the second does not. 

Shannon emphasized that the kind of information his theory described needs 
to be carefully distinguished from our common notions of information. As 
Warren Weaver, one of Shannon’s close collaborators, made clear in 1949, “The 
word ‘information’ in this theory is used in a special mathematical sense that 
must not be confused with its ordinary usage.”~° By ordinary usage, Weaver, of 
course, was referring to the idea of meaningful or functional communication. 

Webster’s dictionary defines information as “the communication or reception 
of knowledge or intelligence.” It also defines information as “the attribute 
inherent in, and communicated by, alternative sequences or arrangements of 
something that produce a specific effect.” A sequence of characters possessing a 
large amount of Shannon information may convey meaning (as in an English 
text) or perform a function that “produces a specific effect” (as do both English 
sentences and computer codes, for example) or it may not (as would be the case 
with a meaningless pile of letters or a screen of scrambled computer code). In 
any case, Shannon’s purely mathematical theory of information does not 
distinguish the presence of meaningful or functional sequences from merely 
improbable, though meaningless ones. It only provides a mathematical measure 
of the improbability—or information-carrying capacity—of a sequence of 
characters. In a sense, it provides a measure of a sequence’s capacity to carry 
functional or meaningful information. It does not, and cannot, determine whether 
the sequence in question does convey meaning or generate a functionally 
significant effect. 

Strands of DNA _ contain information-carrying capacity—something 
Shannon’s theory can measure.** But DNA, like natural languages and computer 
codes, also contains functional information.*° 

In languages such as English, specifically arranged characters convey 
functional information to conscious agents. In computer or machine code, 
specifically arranged characters (zeros and ones) produce functionally significant 
outcomes within a computational environment without a conscious agent 
receiving the meaning of the code inside the machine. In the same way, DNA 
stores and conveys functional information for building proteins or RNA 
molecules, even if it is not received by a conscious agent. As in computer code, 


the precise arrangement of characters (or chemicals functioning as characters) 
allows the sequence to “produce a specific effect.” For this reason, I also like to 
use the term specified information as a synonym for functional information, 
because the function of a sequence of characters depends upon the specific 
arrangement of those characters. 

And DNA contains specified information, not just Shannon information or 
information-carrying capacity. As Crick himself put it in 1958, “By information I 
mean the specification of the amino acid sequence in protein. . . . Information 
means here the precise determination of sequence, either of bases in the nucleic 
acid or on amino acid residues of the protein.”*° 


THE MESSAGE AS THE MYSTERY 


So if the origin of the Cambrian animals required vast amounts of new 
functional or specified information, what produced this information explosion? 
Since the molecular biological revolution first highlighted the primacy of 
information to the maintenance and function of living systems, questions about 
the origin of information have moved decidedly to the forefront of discussions 
about evolutionary theory. What’s more, the realization that specificity of 
arrangement, rather than mere improbability, characterizes the genetic text has 
raised some challenging questions about the adequacy of the neo-Darwinian 
mechanism. Is it plausible to think that natural selection working on random 
mutations in DNA could produce the highly specific arrangements of bases 
necessary to generate the protein building blocks of new cell types and novel 
forms of life? Perhaps nowhere do such questions pose more of a challenge to 
neo-Darwinian theory than in discussions of the Cambrian explosion. 
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Combinatorial Inflation 


Murray Eden (see Fig. 9.1), a professor of engineering and computer science at 
MIT, was accustomed to thinking about how to build things. But when he began 
to consider the importance of information to building living organisms, he 
realized something didn’t add up. His critics said that he knew just enough 
biology to be dangerous. In retrospect, they were probably right. 

In the early 1960s, just as molecular biologists had confirmed Francis Crick’s 
famed sequence hypothesis, Eden began to think about the challenge of building 
a living organism. Of course, Eden wasn’t contemplating building a living 
organism himself. Rather, he was thinking about what it would take for the neo- 
Darwinian mechanism of natural selection acting on random mutations to do the 
job. He wondered whether mutation and selection could generate the needed 
functional information. 

To his way of thinking, specificity was a big part of the problem. Obviously, 
if DNA contained an improbable sequence of nucleotide bases in which the 
arrangement of bases does not matter to the function of the molecule, then 
random mutational changes in the sequence of bases would not have a 
detrimental effect on the function of the molecule. But, of course, sequence does 
affect function. Eden knew that in all computer codes or written text in which the 
specificity of sequence determines function, random changes in sequence 
consistently degrade function or meaning. As he explained, “No currently 
existing formal language can tolerate random changes in the symbol sequences 
which express its sentences. Meaning is almost invariably destroyed.”! Thus, he 
suspected that the need for specificity in the arrangement of DNA bases made it 
extremely improbable that random mutations would generate new functional 
genes or proteins as opposed to degrading existing ones. 

But how improbable? How difficult would it be for random mutations to 
generate, or stumble upon, the genetically meaningful or functional sequences 


needed to supply natural selection with the raw material—the genetic 
information and variation—it needed to produce new proteins, organs, and forms 
of life? Eden wasn’t the only mathematician or scientist asking these questions. 
But the mathematically based challenge to evolutionary theory that he helped to 
initiate would indeed prove dangerous to neo-Darwinian orthodoxy. 


THE WISTAR INSTITUTE CONFERENCE 


During the early 1960s, Eden began discussing the plausibility of the neo- 
Darwinian theory of evolution with several MIT colleagues in math, physics, and 
computer science. As the discussion grew to include mathematicians and 
scientists from other institutions, the idea of a conference was born. In 1966, a 
distinguished group of mathematicians, engineers, and scientists convened a 
conference at the Wistar Institute in Philadelphia called “Mathematical 
Challenges to the Neo-Darwinian Interpretation of Evolution.” Prominent among 
the attendees were Marcel-Paul Schiitzenberger, a mathematician and physician 
at the University of Paris; Stanislaw Ulam, the codesigner of the hydrogen 
bomb; and Eden himself. The conference also included a number of prominent 
biologists, including Ernst Mayr, an architect of modern neo-Darwinism, and 
Richard Lewontin, at the time a professor of genetics and evolutionary biology 
at the University of Chicago. 





FIGURE 9.1 
Murray Eden. Courtesy MIT Museum. 


Sir Peter Medawar, a Nobel laureate and the director of the North London 
Medical Research Council’s laboratories, chaired the meeting. In his opening 


remarks, he said, “The immediate cause of this conference is a pretty widespread 
sense of dissatisfaction about what has come to be thought of as the accepted 
evolutionary theory in the English-speaking world, the so-called neo-Darwinian 
theory.” 

For many, doubts about the creative power of the mutation and selection 
mechanism stemmed from the elucidation of the nature of genetic information 
by molecular biologists in the late 1950s and early 1960s. 

The discovery that the genetic information in DNA is stored as a linear array 
of precisely sequenced nucleotide bases at first helped to clarify the nature of 
many mutational processes. Just as a sequence of letters in an English text might 
be altered either by changing individual letters one by one or by combining and 
recombining whole sections of text, so too might the genetic text be altered 
either one base at a time or by combining and recombining different sections of 
genes in various ways at random. Indeed, modern genetics has established 
various mechanisms of mutational change—not only “point mutations,” or 
changes in individual bases, but also duplications, insertions, inversions, 
recombinations, and deletions of whole sections of the genetic text. 

Although fully aware of this range of mutational options at nature’s disposal, 
Eden argued at Wistar that such random changes to written texts or sections of 
digital code would inevitably degrade the function of information-bearing 
sequences, particularly when allowed to accumulate.’ For example, the simple 
phrase “One if by land and two if by sea” will be significantly degraded by just a 
handful of random changes such as those in bold: “Ine if bg lend and two ik bT 
Nea.” At the conference the French mathematician Marcel Schiitzenberger 
agreed with Eden’s concerns about the effect of random alterations. He noted 
that if someone makes even a few random changes in the arrangement of the 
digital characters in a computer program, “we find that we have no chance (i.e., 
less than 1/10!) even to see what the modified program would compute: it just 
jams.”* Eden argued that much the same problem applied to DNA—that insofar 
as specific arrangements of bases in DNA function like digital code, random 
changes to these arrangements would likely efface their function, while attempts 
to generate completely new sections of genetic text by random means were 
likely doomed to failure.° 

The explanation for this inevitable diminution in function is found in a branch 
of mathematics called combinatorics. Combinatorics studies the number of ways 
a group of things can be combined or arranged. At one level, the subject is fairly 
intuitive. If a thief slips round the corner of a dormitory after hours looking for a 


bike to steal, he will scan the bike rack for an easy target. If he spots a basic 
bicycle-style lock with only three dials of ten numbers each, and on the rack 
beside it one with five dials of ten numbers each, the thief wouldn’t need a 
degree in mathematics to realize which one he should attempt to open. He knows 
that he would need to search fewer total possibilities with the three-dial lock. 

A straightforward calculation supports his intuition. The simpler lock has 
only 10 x 10 x 10, or 1000, possible combinations of digits—or what 
mathematicians refer to as “combinatorial” possibilities. The five-dial lock has 
10 x 10 x 10 x 10 x 10, or 100,000, combinatorial possibilities. With a lot of 
patience, the thief might elect to systematically work his way through the 
different combinations of digits on the simpler lock, knowing that at some point 
he will stumble across the correct combination. He shouldn’t even bother with 
the five-dial lock, since making his way through all of the possible combinations 
on it would take 100 times as long. The five-dial lock simply has too many 
possibilities for the thief to have a reasonable chance of opening it by trial and 
error in the time available to him. 

Several of the Wistar scientists noted that the mutation and selection 
mechanism faces a similar problem. Neo-Darwinism envisions new genetic 
information arising from random mutations in the DNA. If at any time from birth 
to reproduction the right mutation or combination of mutations accumulate in the 
DNA of cells involved in reproduction (whether sexual or asexual), then 
information for building a new protein or proteins will pass on to the next 
generation. When that new protein happens to confer a survival advantage on an 
organism, the genetic change responsible for the new protein will tend to be 
passed on to subsequent generations. As favorable mutations accumulate, the 
features of a population will gradually change over time. 

Clearly, natural selection plays a crucial role in this process. Favorable 
mutations are passed on; unfavorable mutations are weeded out. Nevertheless, 
the process can only select variations in the genetic text that mutations have first 
produced. For this reason, evolutionary biologists typically recognize that 
mutation, not natural selection, provides the source of variation and innovation 
in the evolutionary process. As evolutionary biologists Jack King and Thomas 
Jukes put it in 1969, “Natural selection is the editor, rather than the composer, of 
the genetic message.”° 

And that was the problem, as the Wistar skeptics saw it: random mutations 
must do the work of composing new genetic information, yet the sheer number 
of possible nucleotide base or amino-acid combinations (i.e., the size of the 


combinatorial “space”) associated with a single gene or protein of even modest 
length rendered the probability of random assembly prohibitively small. For 
every sequence of amino acids that generates a functional protein, there are a 
myriad of other combinations that don’t. As the length of the required protein 
grows, the number of possible amino-acid combinations mushrooms 
exponentially. As this happens, the probability of ever stumbling by random 
mutation onto a functional sequence rapidly diminishes. 

Consider another illustration. The two letters X and Y can be combined in 
four different two-letter combinations (XX, XY, YX and YY). They can be 
combined in eight different ways for three-letter combinations (XXX, XXY, 
XYY, XYX, YXX, YYX, YXY, YYY), sixteen ways for four-letter 
combinations, and so on. The number of possible combinations grows 
exponentially—2?, 2°, 2*, and so on—as the number of letters in the sequence 
grows. Mathematician David Berlinski calls this the problem of “combinatorial 
inflation,” because the number of possible combinations “inflates” dramatically 
as the number of characters in a sequence grows (see Fig. 9.2). 

The combinations of bases in DNA are subject to combinatorial inflation of 
just this sort. The information-bearing sequences in DNA consist of specific 
arrangements of the four nucleotide bases. Consequently, there are four possible 
bases that could occur at each site along the DNA backbone and 4 ~ 4, or 47, or 
16 possible two-base sequences (AA AT AG AC TA TG TC TT CG CT CC CA 
GA GG GC GT). Similarly, there are 4 x 4 x 4, or 4°, or 64 possible three-base 
sequences. (I’ll refrain from listing them all.) That is, increasing the number of 
bases in a sequence from 1 to 2 to 3 increases the number of possibilities from 4 
to 16 to 64. As the sequence length continues to grow, the number of 
combinatorial possibilities corresponding to sequences of increasing length 
inflates exponentially. For example, there are 4!°°, or 10°°, possible ways of 
arranging one hundred bases in a row. 
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FIGURE 9.2 

The problem of combinatorial inflation as illustrated by bike locks of varying 
sizes. As the number of dials on the bike locks increases, the number of possible 
combinations rises exponentially. 


The amino-acid chains are also subject to such inflation. A chain of two 
amino acids could display 20°, or 20 x 20, or 400 possible combinations, since 
each of the twenty protein-forming amino acids could combine with any one of 
that same group of twenty in the second position of a short peptide chain. With a 
three-amino-acid sequence, we’re looking at 20°, or 8,000, possible sequences. 
With four amino acids, the number of combinations rises exponentially to 20+, or 
160,000, total combinations, and so on. 

Now, the number of combinatorial possibilities corresponding to a chain with 
four amino acids only marginally outstrips the combinatorial possibilities 


associated with the five-dial lock in my first illustration (160,000 vs. 100,000). It 
turns out, however, that many necessary, functional proteins in cells require far, 
far more than just four amino acids linked in sequence, and necessary genes 
require far, far more than just a few bases. Most genes—sections of DNA that 
code for a specific protein—consist of at least one thousand nucleotide bases. 
That corresponds to 4/°°°_an unimaginably large number—possible base 
sequences of that length. 

Moreover, it takes three bases in a group called a codon to designate one of 
the twenty protein-forming amino acids in a growing chain during protein 
synthesis. If an average gene has about 1000 bases, then an average protein 
would have over 300 amino acids, each of which are called “residues” by protein 
chemists. And indeed proteins typically require hundreds of amino acids in order 
to perform their functions. This means that an average-length protein represents 
just one possible sequence among an astronomically large number—20°", or 
over 10°°°—of possible amino-acid sequences of that length. Putting these 
numbers in perspective, there are only 10© atoms in our Milky Way galaxy and 
10°° elementary particles in the known universe. 

That is what bothered Eden and other mathematically inclined scientists at 
Wistar. They understood the immensity of the combinatorial spaces associated 
with even single genes or proteins of average length. They realized that if the 
mutations themselves were truly random—that is, if they were neither directed 
by an intelligence nor influenced by the functional needs of the organism (as 
neo-Darwinism stipulates)—then the probability of the mutation and selection 
mechanism ever producing a new gene or protein could well be vanishingly 
small. Why? The mutations would have to generate, or “search” by trial and 
error, an enormous number of possibilities—far more than were realistic in the 
time available to the evolutionary process. 

Eden pointed out in his Wistar presentation that the combinatorial space 
corresponding to an average-length protein (which he assumed to be about 250 
amino acids long) is 20*°°—or about 10°*°—possible amino-acid arrangements. 
Did the mutation and selection mechanism have enough time—since the 
beginning of the universe itself—to generate even a small fraction of the total 
number of possible amino-acid sequences corresponding to a single functional 
protein of that length? For Eden, the answer was clearly no. 

For this reason, Eden thought mutations had virtually no chance of producing 
new genetic information. He likened the probability of producing the human 
genome by relying on random mutations to that of generating a library of a 


thousand volumes by making random changes or additions to a single phrase in 
accord with the following instructions: “Begin with a meaningful phrase, retype 
it with a few mistakes, make it longer by adding letters [at random], and 
rearrange subsequences in the string of letters; then examine the result to see if 
the new phrase is meaningful. Repeat this process until the library is complete.”” 
Would such an exercise have a realistic chance of succeeding, even granting it 
billions of years? Eden thought not. 

In addition, Schiitzenberger emphasized that randomly cutting and pasting 
larger blocks of text, as evolutionary biologists often envision, would not make 
any appreciable difference to the efficacy of a random search of sequence space. 
Imagine a computer “mutating” at random the text of the play Hamlet either by 
individual-letter substitutions or by duplicating, swapping, inverting, or 
recombining whole sections of Shakespeare’s text. Would such a computer 
simulation have a realistic chance of generating a completely different and 
equally informative text such as, say, The Blind Watchmaker by Richard 
Dawkins, even granting multiple millions of undirected mutational iterations? 

Schiitzenberger didn’t think so. He noted that making random changes “at the 
typographic level” in a computer program inevitably degrades its function 
whether those changes are made “by letters or by blocks, the size of the unit 
really does not matter.”® Thus, he thought that a process of randomly shuffling 
blocks of text in any “typographic typology” would inevitably degrade meaning 
in much the same way that a series of individual-letter substitutions will. 

Schiitzenberger insisted that the evolutionary process faced similar 
limitations. To him, it seemed extremely unlikely that random mutations of 
whatever sort would produce significant amounts of novel and functionally 
specified information within the time available to the evolutionary process. 

Subsequent to the confirmation of Crick’s sequence hypothesis, all present at 
Wistar understood that the entities that confer functional advantages on 
organisms—new genes and their corresponding protein products—constitute 
long linear arrays of precisely sequenced subunits, nucleotide bases in the case 
of genes and amino acids in the case of proteins. Yet, according to neo- 
Darwinian theory, these complex and highly specified entities must first arise 
and provide some advantage before natural selection can act to preserve them. 
Given the number of bases present in genes, and amino acids present in 
functional proteins, a large number of changes in the arrangement of these 
molecular subunits would typically have to occur before a new functional and 
selectable protein could arise. For even the smallest unit of functional innovation 


—a novel protein—to arise, many improbable rearrangements of nucleotide 
bases would need to occur before natural selection had anything new and 
advantageous to select. 

Eden and others questioned whether mutations provided an adequate 
explanation for the origin of the genetic information necessary to build new 
proteins, let alone whole new forms of life. As physicist Stanislaw Ulam 
explained at the conference, the evolutionary process “seems to require many 
thousands, perhaps millions, of successive mutations to produce even the easiest 
complexities we see in life now. It appears, naively at least, that no matter how 
large the probability of a single mutation is, should it be even as great as one- 
half, you would get this probability raised to a millionth power, which is so very 
close to zero that the chances of such a chain seem to be practically non- 
existent.”? 


LOOKING FOR A LOOPHOLE 


In his presentation at the conference, Eden himself acknowledged a possible way 
of resolving this dilemma. He suggested that it was at least possible that 
“functionally useful proteins are very common in this [combinatorial] space so 
that almost any polypeptide one is likely to find [as the result of mutation and 
selection] has a useful function.”!° Many neo-Darwinian biologists subsequently 
came to favor this possible solution. The solution was this: even though the size 
of the combinatorial space that mutations needed to search was enormous, the 
ratio of functional to nonfunctional base or amino-acid sequences in their 
relevant combinatorial spaces might turn out to be much higher than Eden and 
others had assumed. If that ratio turned out to be high enough, then the mutation 
and selection mechanism would frequently stumble onto novel genes and 
proteins and could easily leapfrog from one functional protein island to the next, 
with natural selection discarding the nonfunctional outcomes and seizing upon 
the rare (but not too rare) functional sequences. 

As an electrical engineer who was used to working with computer code, Eden 
was intuitively disinclined to embrace this possibility. He noted that all codes 
and language systems can convey information precisely because they have rules 
of grammar and syntax. These rules ensure that not just any arrangement of 
characters will convey functional information. For this reason, functional 
sequences in working communications systems are typically surrounded in the 
larger combinatorial space by a multitude of nonfunctional sequences— 
sequences that don’t obey the rules. 

In known codes and language systems, functional sequences do indeed 
typically represent tiny islands of meaning amid a great sea of gibberish. 
Geneticist Michael Denton has shown that in English meaningful words and 
sentences are extremely rare among the set of possible combinations of letters of 
a given length, and they become proportionally rarer as sequence length grows.!! 
The ratio of meaningful 12-letter words to 12-letter sequences is 1/10"; the ratio 
of meaningful 100-letter sentences to possible 100-letter strings has been 
estimated as 1/10!°°. Denton used these figures in 1985 to explain why random 
letter substitutions inevitably degrade meaning in English text after only a few 
changes and why the same thing might be true of the genetic text. 

Given the alphabetic or “typographic” character of genetic information stored 
in DNA, Murray Eden and others at Wistar suspected that the same kind of 


problems would affect random mutational changes in DNA. It seemed logical 
that functional genes and proteins were also surrounded in their relevant 
combinatorial spaces by vast numbers of nonfunctional sequences—and, further, 
that the ratio of functional to nonfunctional sequences would also be exceedingly 
small. 

Yet in 1966 none of the scientists on either side of the debates at Wistar knew 
how rare or common functional gene and amino-acid sequences are among the 
corresponding space of total possibilities. Do they occur with a frequency of 1 in 
10, 1 in a million, or 1 in a million billion trillion? At the time, these questions 
could not be answered. 

Most evolutionary biologists remained optimistic that the answer to this 
question would vindicate the neo-Darwinian model.'* And some developments 
supported their confidence. During the late 1960s, molecular biologists learned 
that most of the functional roles performed by proteins are performed not just by 
one precise kind of protein, but by a wide variety, each with its own amino-acid 
sequence. This is unlike a bike lock, which has only one functional combination. 
Indeed, molecular biologists learned that though some amino acids at certain 
sites are absolutely essential for any particular protein to work, most sites 
tolerate amino-acid substitutions without loss of protein function. For many 
biologists, this suggested that mutation and selection had a reasonable chance of 
generating functional sequences of nucleotide bases or amino acids after all— 
that the ratio of functional to nonfunctional sequences was much higher than 
skeptics had anticipated. 

How much higher? How much variability is allowed in the amino-acid 
sequences in proteins? Are there enough functional proteins within a relevant 
combinatorial space of possibilities to render a random mutational search for 
new proteins plausible? 

When Denton compared linguistic and genetic text to explain the potential 
severity of the combinatorial inflation facing the neo-Darwinian mechanism, he 
noted that biologists still didn’t know enough “to calculate with any degree of 
certainty the actual rarity of functional proteins.” He concluded, however, that 
since future experiments surely would continue to deepen molecular biology’s 
fund of knowledge, “it may be that before long quite rigorous estimates may be 
possible.”!° 


IN SEARCH OF THE RATIO 


Denton’s prediction of imminent progress proved correct. During the late 1980s 
and early 1990s, Robert Sauer, a molecular biologist at MIT, performed a series 
of experiments that first attempted to measure the rarity of proteins within 
amino-acid sequence space. 

Sauer’s work exploited, for the first time, new technology that allowed for the 
systematic manipulation of gene sequences. Before the late 1970s, scientists 
typically used radiation and chemicals to produce mutant forms of DNA. 
Though these techniques sometimes paid off with dramatic results, such as 
mutant fruit flies with legs growing out of their heads (the famed Antennapedia 
mutation), they did not allow scientists to dictate or target any specific change to 
a sequence of bases in DNA. The treatments used simply replicated the 
conditions under which mutations occur naturally. 

During the late 1970s and early 1980s, however, molecular biologists 
developed technologies for making customized synthetic DNA molecules. 
Robert Sauer used these techniques to make site-directed changes to DNA 
sequences of specific genes of known function and then to insert those variants 
into bacterial cells. He could then evaluate the effect of various targeted 
alterations to a DNA sequence on the function of their protein products within a 
bacterial cell culture. 

Sauer’s technique allowed him to begin to evaluate how many of the variant 
sequences, as a percentage of the total, still produced a functional form of the 
relevant protein (see Fig. 9.3). His initial results confirmed that proteins could 
indeed tolerate a variety of amino-acid substitutions at many of the sites in the 
protein chain. Yet his experiments also suggested that functional proteins might 
be incredibly rare among the space of all possible amino-acid sequences. Based 
on one set of mutagenesis experiments, Sauer and his colleagues estimated the 
ratio of functional to nonfunctional amino-acid sequences at about 1 to 10°? for a 
short protein of 92 amino acids in length.'4 

This result was in rough agreement with an earlier estimate by information 
theorist Hubert Yockey.!° Yockey did not perform experiments to derive his 
estimate of the rarity of proteins in combinatorial sequence space. Instead, he 
used already published data to compare variants of the similar cytochrome c 
proteins (proteins involved in the biochemical pathways that generate energy in 
cells) in different species. He did this to see how much variability existed at each 


amino-acid site in molecules performing the same function with the same basic 
structure. Using this data about the allowable variability at each site, he 
estimated the probability of finding one of the allowable sequences among the 
total number of sequences corresponding to a cytochrome c protein 100 residues 
in length. He determined the ratio of functional to nonfunctional sequences to be 
about 1 to 10°° for amino-acid chains of this length.!° So, although Sauer’s 
experimentally derived results were numerically different from Yockey’s, both 
approaches gave extremely low ratios suggesting that functional proteins are 
indeed rare in sequence space, even if proteins do admit significant variability in 
the specific amino acids present at various positions. 
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FIGURE 9.3 

Figure 9.3a (top) depicts the problem of combinatorial inflation as it applies to 
proteins. As the number of amino acids necessary to produce a protein or protein 
fold grows, the corresponding number of possible amino-acid combinations 
grows exponentially. Figure 9.3b (bottom) poses graphically the question of the 
rarity of proteins in that vast amino-acid sequence space. 


Taken at face value, Sauer’s experiments appeared to yield contradictory 
conclusions. On the one hand, his results showed that many arrangements of 
amino acids could produce the same protein structure and function—that 
numerous amino-acid sequences populated amino-acid sequence space. On the 
other hand, the ratio of functional sequences to the total number of possible 
sequences corresponding to a sequence of roughly 100 amino acids appeared to 
be incredibly low, just 1 to 10°. 

Nevertheless, it’s not hard to see how both of Sauer’s seemingly contradictory 
conclusions could be true. Recall the locks confronting my hypothetical bike 
thief. Commercially manufactured bike locks typically have only one 
combination of digits that will allow them to be opened. The combination that 
will open a typical bike lock specifies one digit on each dial. No variability at 
any dial is allowed. 

Now imagine a new kind of lock with three crucial differences from an 
ordinary lock. First, with this new alternative lock, there are four positions on 
every dial that may—in combination with other positions on other dials—open 
the lock. My bike thief would like this feature of this kind of lock, since it seems 
to allow more wiggle room at each dial. But he doesn’t like the two other 
features of this lock. For one, each dial displays one of 20 letters rather than one 
of 10 numeric digits. Second, instead of 5 dials, there are 100 dials. On the 
upside, because 4 of the 20 letters on each of the 100 dials might work, there are 
4100 or a whopping 10, correct combinations that will open the lock. That’s an 
astronomically large number of correct combinations. But, on the downside, 
there are 20 possible settings at each of 100 dials, which computes to 20!, or 
10!%°, possible combinations, a number that totally dwarfs the number of correct 
combinations of dial settings. 

The bike thief would be happy to learn that every dial has “only” four 
possible correct positions on each dial, but in a trial-and-error process, he still 
has only 1 chance in 5 (4 out of 20) of landing on a possibly functional dial 
position for any dial; and this 1-in-5 chance must be negotiated 100 times. In 


other words, the 1-in-5 chance must be multiplied by the 100 dials on the 
monster lock in order to arrive at the probability of the thief stumbling upon a 
functional combination on a given try. The odds are 1 chance in 5!°° or—if we 
want to convert that to base 10—roughly 1 chance in 10”. The odds are that 
slender because the functional combinations—numerous as they are—are 
dwarfed by the number of total combinations. 

In the same way, Sauer established that though many different combinations 
of amino acids will produce roughly the same protein structure and function, the 
sequences capable of producing these functional outcomes are still extremely 
rare. He showed that for every functional 92-amino-acid sequence there are 
roughly another 10° nonfunctional sequences of the same length. To put that 
ratio in perspective, the probability of attaining a correct sequence by random 
search would roughly equal the probability of a blind spaceman finding a single 
marked atom by chance among all the atoms in the Milky Way galaxy—on its 
face clearly not a likely outcome.!” 


UNCERTAIN SITUATION 


Nevertheless, during the 1990s in the immediate wake of the publication of 
Sauer’s results, the implications of his work for evolutionary theory were not 
entirely clear. Even in the scientific paper in which Sauer reported his work, the 
abstract summarizing his results emphasized the tolerance to amino-acid 
substitution that proteins allow. Consequently, scientists on both sides of the 
discussion about Darwinian evolution seized on different aspects of Sauer’s 
findings either to support or challenge the plausibility of the neo-Darwinian 
account of the origin of genes and proteins. 

Scientists sympathetic to neo-Darwinism emphasized the tolerance of 
proteins to amino-acid substitution; critics of the theory emphasized the rarity of 
proteins in sequence space. One scientist, Ken Dill, a biophysicist at University 
of California, San Francisco, cited Sauer’s work to suggest that nearly any amino 
acid would work at any site in a protein chain, provided the amino acids in 
question exhibited the correct hydrophobicity (water-repelling) or hydrophilicity 
(water-attracting) properties. ® 

Yet, at least one scientist, Lehigh University biochemist Michael Behe, cited 
Sauer’s quantitative estimate of the rarity of proteins as a decisive refutation of 
the creative power of the mutation and selection mechanism altogether.'? So by 
the mid-1990s, though Sauer and his group had initiated a program of 
experimental research that addressed the key question that Murray Eden raised at 
Wistar, that question had still not been completely settled. Did the mutation and 
natural selection mechanism have a realistic chance of finding the new genes and 
proteins necessary to build, for example, a new Cambrian animal? Answering 
that would await an even more systematic and comprehensive experimental 
regime. 
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The Origin of Genes and Proteins 


As a Ph.D. student in chemical engineering at the California Institute of 
Technology in the late 1980s, Douglas Axe (see Fig. 10.1) became interested in 
evolutionary theory after several fellow graduate students read the then- 
bestselling book by Richard Dawkins, The Blind Watchmaker. Axe’s compatriots 
were quickly converted to zealous advocates of Dawkins’s arguments and urged 
him to read the book for himself. Axe was impressed by the clarity of Dawkins’s 
writing and illustrations, but he found his case for the creative power of natural 
selection and random mutations unpersuasive. Whether in the analogies he drew 
to animal breeding or the computer simulations he used to demonstrate the 
supposed ability of mutation and selection to generate new genetic information, 
Dawkins repeatedly smuggled in the very thing he insisted the concept of natural 
selection expressly precluded: the guiding hand of an intelligent agent. 

He found Dawkins’s computer simulation particularly interesting. In The 
Blind Watchmaker, Dawkins described how he had programmed a computer to 
generate the Shakespearean phrase: “Me thinks it is like a weasel.”! Dawkins did 
this in order to simulate how random mutations and natural selection could 
generate new functional information. 

Dawkins programmed the computer first to generate many separate strings 
(sequences) of English letters. He then programmed it to compare each string to 
the Shakespearean target phrase and select only the string that most closely 
resembled that target. The program then generated variant versions of that 
newly selected string and compared those sequences to the target, selecting, 
again, only the one that most closely resembled the desired target. This 
eventually generated—after many iterations—a string that matched the target 
perfectly. 

Axe recognized immediately the role that Dawkins’s own intelligence had 
played. Not only did Dawkins provide the program with the information that he 


wanted it to generate (“Me thinks it is like a weasel”), he imbued the computer 
with a kind of foresight by directing it to compare the variant sequences of 
letters with the desired target. Axe realized that Dawkins’s program did not 
simulate natural selection, which by definition is neither guided toward nor 
given information about a desired outcome generations in the future. 

Axe began to wonder if there was some other way to assess the creative 
power of the mutation and natural selection mechanism, not with clever 
analogies or simplistic computer simulations, but with experimental and 
mathematical rigor. 

Axe realized that Dawkins was right about one thing: the importance of 
genetic information. Like his fellow engineer Murray Eden, Axe’s tendency to 
view biology as an engineer led him to ask whether selection and mutation could 
actually build new organisms. Axe’s own research explored the connection 
between process control (a field of study in engineering) and genetic regulation, 
a sophisticated version of automated process control at work on a molecular 
scale inside living cells. Since cells use proteins to perform various feats of 
regulation, Axe was acutely aware that building new organisms necessarily 
involved building new proteins, which would in turn require new genetic 
information. 








FIGURE 10.1 
Douglas Axe. Courtesy Brittnay Landoe. 


But could mutation and selection generate the precise arrays of nucleotide 
bases necessary to build fundamentally new protein structures? Axe’s interest in 
this question eventually led him to the scientific papers of Robert Sauer and the 
proceedings of a seemingly obscure 1966 Philadelphia conference called 
“Mathematical Challenges to the Neo-Darwinian Interpretation of Evolution.” 


UNRESOLVED ISSUES 


As Axe read the papers that Sauer’s research group had produced, he realized 
their importance as a first step toward answering the questions Murray Eden had 
raised at Wistar. If Sauer’s quantitative measures of rarity held up, then Axe 
thought it obvious that mutation and selection could not adequately search a 
space that large. If, on the other hand, subsequent mutagenesis experiments 
overturned Sauer’s work and showed that protein function was largely 
indifferent to changes in amino-acid sequence, then the number of functional 
sequences might be large enough that mutation and selection would have a good 
chance of finding new functional genes and proteins in a reasonable amount of 
time. 

After completing his Ph.D., Axe made inquiries about doing postdoctoral 
research in a top research lab where he could address these unanswered 
questions. He was soon invited by Alan Fersht, a professor at the University of 
Cambridge and director of the Centre for Protein Engineering, part of the world- 
famous Medical Research Council (MRC) Centre at Cambridge, to join his 
research group. 

The decision to accept Fersht’s offer was an easy one. The star-studded 
history of the adjoining Laboratory for Molecular Biology (LMB), arguably the 
birthplace of molecular biology, included such luminaries as James Watson, 
Francis Crick, Max Perutz, John Kendrew, Sidney Brenner, and Fred Sanger. 
Starting in the chemistry department and then moving to the MRC Centre, Axe 
hoped to apply his research training to resolve the uncertainty that surrounded 
the interpretation of Sauer’s results. Specifically, he wanted to eliminate what he 
saw as two sources of error in Sauer’s method in order to get a more definitive 
estimation of the frequency of functional sequences in sequence space. 

Axe thought, first, that Sauer’s team might have underestimated the rarity of 
functional proteins. In their experiments, Sauer’s group tested the tolerance of 
proteins to amino-acid substitution by changing amino acids at one or a few 
consecutive sites without making any other changes to other sites at the same 
time, much like a typist introducing an isolated typographic error in an otherwise 
accurately transcribed text. Not surprisingly, Sauer and his colleagues found that 
many sites along a protein chain could tolerate these isolated amino-acid 
substitutions, just as the reader of a text with only a few typos can often make 
out its meaning. Sauer’s team seemed to be assuming that a similar tolerance 


would have emerged if they had changed many sites simultaneously. 

Axe thought that this assumption ignored the importance of the larger context 
provided by the mostly unaltered protein. A single typographic error typically 
will not totally destroy the meaning of a section of English text, because of the 
surrounding context provided by the other words, as well as the correct letters in 
the altered word. That does not mean, however, that specificity of sequence 
doesn’t matter. Instead, the meaning of a sentence with a typographic error can 
be discerned only because the rest of the letters are specifically arranged into 
meaningful words and phrases that provide a context for determining the 
meaning of the incorrectly spelled word. For just this reason, however, the 
meaning of a sentence is rapidly degraded if errors are allowed to accumulate at 
multiple sites. 

Axe wondered if much the same could be true of genes and proteins. He 
wondered whether multiple, as opposed to single, position changes would 
quickly degrade function and whether a tolerance for substitutions at individual 
sites was itself context dependent—whether the tolerance for substitution at one 
site might depend upon having highly specific sequences at other sites. Thus, 
without questioning Sauer’s experimental findings, Axe thought Sauer’s result 
lent itself to misinterpretation. To many molecular biologists, it suggested that 
proteins could readily accommodate many simultaneous changes to their amino- 
acid sequences at many positions and remain functional. 

As it turns out, Sauer recognized the potential for misinterpretation of his 
results. As he explained in the very paper in which he developed his quantitative 
estimate of rarity, “this calculation overestimates the number of functional 
sequences, since changes at individual positions are less likely to be independent 
of one another as more positions are allowed to vary.”° 

Another assumption in Sauer’s approach had potentially the opposite effect 
—exaggerating the rarity of functional proteins. Axe thought the test that Sauer 
and his colleagues used to decide whether their mutant proteins were functional 
required a higher level of function than natural selection might require. Sauer 
and his team judged proteins with less than about 5—10 percent of the function 
seen in the natural protein to be nonfunctional. Yet Axe knew that even damaged 
enzymes with less than 5 percent of normal activity could add significantly more 
benefit than no enzymatic activity at all. Thus, from a neo-Darwinian point of 
view, the emergence of even such handicapped proteins might confer a 
selectable advantage on an organism. Axe thought that by rejecting as 
nonfunctional such mutated sequences, Sauer’s team probably had introduced 


another estimation error. These competing errors made it hard to know if the 
estimate made by Sauer’s team was too high or too low or whether perhaps they 
might neatly cancel each other out. To eliminate both sources of possible error, 
Axe carefully designed a new series of experiments. 


THE IMPORTANCE OF FOLDS 


Axe had a key insight that animated the development of his experimental 
program. He wanted to focus on the problem of the origin of new protein folds 
and the genetic information necessary to produce them as a critical test of the 
neo-Darwinian mechanism. Proteins comprise at least three distinct levels of 
structure:* primary, secondary, and tertiary, the latter corresponding to a protein 
fold. The specific sequence of amino acids in a protein or polypeptide chain 
make up its primary structure. The recurring structural motifs such as alpha 
helices and beta strands that arise from specific sequences of amino acids 
constitute its secondary structure. The larger folds or “domains” that form from 
these secondary structures are called tertiary structures (see Fig. 10.2). 

Axe knew that as new life-forms arose during the history of life—in events 
such as the Cambrian explosion—many new proteins must also have arisen. 
New animals typically have new organs and cell types, and new cell types often 
call for new proteins to service them. In some cases new proteins, while 
functionally new, would perform their different functions with essentially the 
same fold or tertiary structure as earlier proteins. But more often, proteins 
capable of performing new functions require new folds to perform these 
functions. That means that explosions of new life-forms must have involved 
bursts of new protein folds as well. 
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FIGURE 10.2 

Different levels of protein structure. The first panel at the top shows the primary 
structure of a protein: a sequence of amino acids forming a polypeptide chain. 
The second panel depicts, in two different ways, two secondary structures: an 
alpha helix (left), and beta strands forming a beta sheet (right). The third panel at 
the bottom shows, in two different ways, a tertiary structure—that is, a protein 
fold. 


The late geneticist and evolutionary biologist Susumu Ohno noted that 
Cambrian animals required complex new proteins such as, for example, lysyl 
oxidase in order to support their stout body structures. When these molecules 
originated in Cambrian animals, they also likely represented a completely novel 
folded structure unlike anything present in Precambrian forms of life such as 
sponges or one-celled organisms. Thus, Axe was convinced that explaining the 
kind of innovation that occurred during the Cambrian explosion and many other 


events in the history of life required a mechanism that could produce, at least, 
distinctly new protein folds. 

He had another reason for thinking that the ability to produce novel protein 
folds provided a critical test for the creative power of the mutation and selection 
mechanism. As an engineer, Axe understood that building a new animal required 
innovation in form and structure. As a protein scientist, he understood that new 
protein folds could be viewed as the smallest unit of structural innovation in the 
history of life. 

It follows that new protein folds represent the smallest unit of structural 
innovation that natural selection can select. Of course, natural selection can 
operate on smaller units of change—individual amino-acid changes that result in 
slight functional advantages or fitness gains, but not new folds, for example. But 
what if the functional or fitness gains that natural selection preserves and passes 
on never generate structural innovations? What if, instead, it only preserves 
slight differences in the sequence or function of proteins that confer an 
advantage without altering structure? Then, clearly, fundamental changes in the 
form of an organism will not occur. Building fundamentally new forms of life 
requires structural innovation. And new protein folds represent the smallest 
selectable unit of such innovation. Therefore, mutations must generate new 
protein folds for natural selection to have an opportunity to preserve and 
accumulate structural innovations. Thus, Axe realized that the ability to produce 
new protein folds represents a sine qua non of macroevolutionary innovation. 

Could random mutations generate such novel protein folds? Axe realized that 
answering this question depended upon measuring the rarity of functional genes 
and proteins in sequence space and determining whether random genetic 
mutations would have enough opportunities to search the relevant sequence 
spaces within evolutionary time. 


AXE’S INITIAL RESULTS 


Axe read the paper by Sauer and his colleague John Reidhaar-Olson that 
estimated the proportion of functional protein sequences to be extremely low (1 
in 10°). He noticed that the authors chose not to emphasize this measure of 
rarity, however, but instead the variety of amino-acid substitutions that the 
protein under study could tolerate. 

In their paper, Reidhaar-Olson and Sauer also repeated a then popular idea 
that the amino acids buried in the interior of a folded protein (forming what is 


known as the hydrophobic core) are most important for specifying the structure, 
while the arrangement of the exterior amino acids did not matter nearly as 
much.° They thought that the amino acids buried within folded proteins typically 
need only to be hydrophobic (water repelling), whereas the amino acids on the 
exterior, for the most part, need to be hydrophylic (water attracting). In fact, 
some protein scientists thought that these simple restrictions might be the whole 
story—that a functional protein fold might require nothing more than an 
appropriate arrangement of hydrophobic and hydrophylic amino acids in a given 
sequence. 

At Fersht’s lab in Cambridge, Axe conducted an experimental test of this 
idea, and surprised himself with the first result. In a paper he coauthored in the 
Proceedings of the National Academy of Sciences in 1996, he reported his 
findings. When he replaced the entire thirteen-residue hydrophobic core of a 
small enzyme with random combinations of other hydrophobic amino acids, a 
high fraction of the randomized proteins (about one-fifth) still performed their 
original function. This suggested that proteins were, perhaps, less susceptible to 
functional loss as a result of sequence changes than Axe had thought. 

Next he focused on the exterior of proteins, randomizing portions of two 
different proteins’ exteriors in much the same way that he had randomly changed 
the interior of one of them. This time his approach failed to produce any 
functional variants at all. Realizing that this seemed to contradict what Sauer and 
others had supposed, Axe decided to make only much more restrictive changes 
in the next trial. He replaced each exterior amino-acid residue only with its most 
similar amino-acid alternative. Nevertheless, both of the proteins that he studied 
still lost all function by the time he had replaced one-fifth of their exterior 
residues. Thus he concluded that the exterior parts of the proteins were much 
more susceptible to functional loss as a result of amino-acid changes than had 
been widely assumed. 

In all this work, Axe designed his experiments to remedy the two sources of 
estimation error inherent in Sauer’s method. First, by studying amino-acid 
changes in combination rather than in isolation, he determined that the 
surrounding context typically did influence whether a particular amino-acid 
change at a particular site caused functional loss. In other words, he discovered 
that the neglect of the influence of the surrounding context had the effect of 
exaggerating the tolerance to amino-acid changes at particular sites, as he (and 
Sauer) had suspected. 

Second, the proteins that Axe chose for his study made it possible for 


function to be detected at much lower levels than was possible in Sauer’s 
studies. For one protein Axe studied, the more sensitive test of function did 
indeed allow a greater proportion of single mutants to retain some function, with 
about 95 percent of the mutant proteins achieving the designation “active.” This 
suggested that Sauer’s less sensitive screen did contribute to another source of 
estimation error, this time in the opposite direction. Nevertheless, Axe’s more 
sensitive screen also enabled him to establish that even though single mutations 
allow many proteins to retain some function, they still diminish or damage the 
function of the protein—often enough to ensure that they will be eliminated by 
the purifying effect of natural selection. Further, because of the extreme 
sensitivity of his test for function, Axe learned that any single mutation that 
failed his test was single-handedly destroying function. He determined that fully 
5 percent of such changes did destroy protein function. 

Overall, therefore, he showed that despite some allowable variability, proteins 
(and the genes that produce them) are indeed highly specified relative to their 
biological functions, especially in their crucial exterior portions. Axe showed 
that whereas proteins will admit some variation at most sites if the rest of the 
protein is left unchanged, multiple as opposed to single amino-acid substitutions 
consistently result in rapid loss of protein function. This was the case even when 
these changes occur at sites that allow variation when altered in isolation.° His 
new experiment also roughly confirmed Sauer’s earlier quantitative assessment 
of the rarity of functional proteins, despite the estimation errors inherent in 
Sauer’s method. Why? Because it appeared that Sauer’s two estimation errors— 
ignoring context and using an insufficiently sensitive screen for function—did, 
in fact, roughly cancel each other out. 

Despite these advances in understanding, Axe had not yet determined 
whether the greatly restricted picture of tolerance that his work had exposed 
would cause problems for the evolution of new protein folds. In order to answer 
that question he would need to obtain a more precise quantitative estimate of the 
rarity of proteins in sequence space. 

Having developed a method that eliminated the main sources of estimation 
error in earlier mutagenesis experiments, Axe was now in a position to answer 
that question with unprecedented rigor. Once he did, he could determine whether 
random genetic changes would have enough opportunities—even on the scale of 
evolutionary time—to search the relevant sequence spaces for functional genes 
and proteins. 


DON’T LEAVE THE FOLD 


Of course, Axe understood that neo-Darwinists do not envision a completely 
random journey through nucleotide or amino-acid sequence space. They see 
natural selection acting to preserve useful mutational variations and to eliminate 
deleterious ones. Richard Dawkins, for example, likens an organism to a high 
mountain peak.’ He compares climbing the sheer precipice up the front side of 
the mountain to building a new organism purely by chance—random mutations 
alone. He acknowledges that this approach up “Mount Improbable” will not 
succeed. Nevertheless, he asserts that there is a gradual slope up the back side of 
the mountain that can be climbed in small incremental steps. In his analogy, the 
back side up “Mount Improbable” corresponds to the process of natural selection 
acting on many small random changes in the genetic text. What chance alone 
cannot do, natural selection acting on random mutations can accomplish through 
the cumulative effect of many slight successive steps. 

Yet Axe’s experimental results presented a problem not only for scenarios 
involving random mutations acting alone, but also for scenarios envisioning 
selection and random mutation acting in concert. Further, his mutagenesis 
experiments cast doubt on each of the two scenarios by which evolutionary 
biologists might envision new protein folds (and the information necessary to 
produce them) arising as the result of the mutation and selection mechanism. 

In theory, new genes capable of producing a new protein fold might arise 
either from (a) preexisting genes or from (b) nonfunctional sections of the 
genome. That is—to adapt Dawkins’s visual analogy—mutation and natural 
selection could conceivably generate a new functional gene starting from either 
(a) another mountain peak (a different preexisting functional gene) or (b) from 
the valley floor (a nonfunctional section of the genome). Yet Axe’s experimental 
results would show that the action of natural selection would not help solve the 
search problem confronting the mutation mechanism in either of these two cases. 
To see why, we need to understand a bit more about each of these two possible 
neo-Darwinian scenarios as well as Axe’s subsequent experimental findings. 


FROM PEAK TO PEAK 


In the first case, evolutionary biologists might envision mutation and selection 
gradually altering a preexisting gene (and its protein product) to produce another 
functional gene (and a different protein product). This scenario involves moving 
metaphorically from one functional peak to another without dipping into a valley 
(a zone of diminished fitness or nonfunction). 

Most evolutionary biologists reject this first scenario.? They do so because 
they recognize that mutations in preexisting genes will typically degrade 
functional genetic information. They know, too, that when genes lose function, 
natural selection will eventually eliminate the organisms that possess these 
genes. Genes that contribute to the healthy function of an organism, that have 
been mutated in such a way as to diminish that function, will be subject to what 
evolutionary biologists call “purifying selection.” That is, natural selection will 
typically eliminate organisms possessing mutation-induced gene variants that 
diminish function or fitness. (When natural selection preserves genetic changes 
that enhance function or fitness, evolutionary biologists call that “positive 
selection.”) 

Axe’s mutagenesis experiments confirmed these reasons for doubting the first 
of the possible neo-Darwinian scenarios, at least as an explanation for the origin 
of new protein folds. In work that he published in 2000, he showed that it is, 
indeed, exceedingly difficult to make extensive changes to functional amino-acid 
sequences without destabilizing a protein fold. Even best-case changes involving 
the most chemically similar amino acids in the exterior of proteins tended to 
destabilize protein folds. 

In these experiments, Axe mutated a gene that produced a protein exhibiting a 
single fold and function. He found that as he altered this protein, multiple 
position changes in the exterior of the protein molecule quickly effaced or 
destroyed its function.? Yet to turn one protein with a distinctive folded structure 
into another with a completely novel structure and function requires specified 
changes at many, many sites—far more than Axe altered in his experiments.!° 
The number of changes necessary to produce a new protein fold typically 
exceeds the number of changes that will result in functional loss. Given this, the 
probability of the evolutionary process successfully traversing a functional 
landscape from one functional peak to another—all the while escaping 
functional loss each step along the way—is extremely small, with the probability 


diminishing exponentially with each additional requisite change.'! Indeed, by 
showing that functional proteins with distinct folds are far more sensitive to 
functional loss than protein scientists had previously assumed, Axe’s 
experiments confirmed what most evolutionary biologists suspected—namely, 
that protein-to-protein (or functional gene—to—functional gene) evolution is a no- 
go where the mutation and selection mechanism must produce a new protein fold 
(see Fig. 10.3). 

Axe had a more fundamental reason for considering the first evolutionary 
scenario implausible. Based on the physical principles of protein function, the 
vast majority of protein functions simply cannot be performed by unfolded 
proteins. In other words, stability of protein structure is a precondition of protein 
function. Destabilized protein folds not only lose the three-dimensional 
structures they need to perform functional tasks, they are also vulnerable to 
attack by other proteins called proteases that devour unfolded proteins or 
polypeptides in the cell.! 

As one structure is degraded as the result of multiple sequence changes, it 
will necessarily lose structural stability, resulting in a catastrophic loss of 
function. Yet any diminution in the function of a protein will also diminish 
fitness in a way that will subject the protein (and its corresponding gene) to the 
purifying action of natural selection.!° Indeed, according to the equations of 
population genetics, the standard mathematical expression of neo-Darwinian 
theory, even slight losses in fitness will subject the disadvantageous traits that 
produce such losses to purifying selection, thus eliminating them. That means 
that even many protein sequences that retain a significant, though diminished, 
portion of their original function nevertheless will not survive the winnowing 
effects of the neo-Darwinian mechanism. Thus, the gradual transformation of 
one functional fold into another was a complete nonstarter. 
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ZONE OF FUNCTION: TIME AND TIDE WAIT FOR NO MAN 





FIGURE 10.3 

This figure illustrates why many evolutionary biologists reject the idea that 
genes and proteins under selection pressure will evolve into new functional 
genes and proteins. Since genes, like English sentences, contain sequence- 
specific functional information, multiple changes in the genetic text will 
inevitably degrade function (or fitness) long before a new functional sequence 
will arise—just as random changes in a meaningful English sentence will 
typically destroy meaning long before such changes produce a significantly 
different meaningful sentence. 


Research performed at the European Molecular Biology Laboratory by 
molecular biologist Francisco Blanco has since confirmed this conclusion. Using 
site-directed mutagenesis, Blanco’s team found that the sequence space between 
two naturally occurring protein domains is not continuously populated by folded 
or functional proteins. By sampling intermediate sequences between two 
sequences that do adopt different folds, Blanco found that the intermediate 
sequences “lack a well-defined three-dimensional structure.” Thus, he concluded 
that “the appearance of a completely new fold from an existing one is unlikely to 
occur by evolution through a route of folded intermediate sequences.”!4 

Thus, both experimental results and the physics of protein folding implied 
that random searches for novel proteins starting from preexisting protein-coding 
genes would result in functional loss long before a protein with a novel fold 
would emerge, as most evolutionary biologists already suspected. Although the 
first of the two possible evolutionary scenarios has the advantage of starting on a 
mountain peak—with a functional gene and protein—it also has a lethal 
disadvantage: randomly mutating the gene will soon destabilize a protein fold 
and/or generate nonfunctional intermediate sequences and structures long before 
a new gene (capable of generating a new fold) would arise. For this reason, this 
scenario involves not so much a climb up Mount Improbable, but a step out over 
Valley Impassable. 


SCALING MOUNT IMPROBABLE 


For all these reasons, like most evolutionary biologists, Axe thought the second 
neo-Darwinian scenario—in which new genes and proteins emerge from 
nonfunctional or neutral regions of the genome—provides a much more 
plausible means of producing the information necessary to construct novel 
protein folds. It was to this scenario that Axe turned his experimental energies. 

In this scenario, neo-Darwinists envision new genetic information arising 
from sections of the genetic text that can vary freely without consequence to the 
organism. According to this scenario, noncoding sections of the genome or 
duplicated sections of coding regions undergo a protracted period of “neutral 
evolution”!’ in which alterations in nucleotide sequences have no discernible 
effect on the fitness of the organism. Functional genes and proteins gradually 
rise from a nonfunctional valley floor to a functional mountain peak—generating 
a new gene. Natural selection plays a role, but not until a new functional gene 
has arisen. 

Evolutionary biologists typically picture this process beginning with a gene 
duplication event. Although several different mechanisms can generate gene 
duplicates in DNA,!° the most common mechanism occurs during the crossing- 
over step of meiosis (a kind of cell division that produces sex cells, or gametes, 
in sexually reproducing organisms). During meiosis, homologous chromosomes 
swap segments of DNA. In a normal crossing-over event, corresponding 
chromosomal segments of equal size are swapped between the two homologous 
chromosomes, ensuring that both chromosomes experience no net gain or loss of 
genes. Sometimes, however, chromosomes swap genetic material of unequal 
length. When this happens, one chromosome (the one that gets the smaller piece) 
ends up losing some DNA, while the chromosome that receives the larger 
segment of genetic material ends up with a new stretch of chromosomal DNA— 
one that may include a gene or genes it already had. This results in duplicate 
copies of a gene on one chromosome. 

When this occurs, one of the two genes may begin to vary—to experience 
mutations—without adversely affecting the function of the organism, while the 
other performs the original function. In the jargon of evolutionary biology, 
mutational changes in gene duplicates are “selectively neutral”—they initially 
provide no advantage or disadvantage to an organism or population. These gene- 
duplication events allow nature room to experiment safely. Unhelpful but 


harmless genetic novelties can be passed on to future generations, where 
additional mutations one day may render the evolving genetic material useful. 
Eventually, as mutational changes accumulate, a new gene sequence may arise in 
a new organism that can code for a novel protein fold and function. At that point, 
natural selection can favor the new gene and its protein product, preserving and 
passing it along to future generations—or so the story goes. 

This scenario—which many evolutionary biologists now refer to as the 
“classical model” of gene evolution—has the advantage of allowing portions of 
the genome to vary freely through many generations, giving mutations many 
opportunities to “search” the space of possible base sequences without being 
punished for drifting into valleys of lost or diminished function. 

But this scenario faces an overriding problem: the extreme rarity of sequences 
capable of forming stable folds and performing biological functions. Since 
natural selection does nothing to help generate new folded, functional 
sequences, but rather can only preserve such sequences once they have arisen, 
random mutations alone must search for the exceedingly rare folded and 
functional sequences within the vast sea of combinatorial possibilities. 

And that is the big story associated with Axe’s experiments. His research 
showed that folded, functional sequences of amino acids are indeed exceedingly 
rare within sequence space. After his initial round of experiments, Axe 
performed another series of site-directed mutagenesis experiments on a 150- 
amino-acid protein-folding domain within a B-lactamase enzyme and published 
the results in the Journal of Molecular Biology.’ Recall that a folding domain is 
a portion of a larger protein that exhibits a distinctive fold. Since amino-acid 
chains must first fold into stable three-dimensional structures, Axe performed 
experiments that enabled him to estimate the frequency of sequences that will 
produce stable folds—any stable fold—before he estimated the frequency of 
sequences performing a specific (f-lactamase) function. His improved 
experimental method produced a precise quantitative result. He estimated (a) the 
number of 150-amino-acid-long sequences capable of folding into stable 
“function-ready” folded structures compared to (b) the whole set of possible 
amino-acid sequences of that length (recall Fig. 9.3). Based on his site-directed 
mutagenesis experiments, he determined that ratio to be a vanishingly small 1 in 
10”. In other words, for sequences 150 amino acids long, only 1 in 10% 
sequences will be capable of folding into a stable protein. 

For a sequence to achieve a protein fold is only a first step, however. A 
protein must be folded to be functional, but a folded protein is not necessarily a 


functional protein. And although sequences capable of forming stable protein 
folds are necessary to any significant evolutionary innovation, natural selection 
cannot select for the presence of a fold unless it also performs a function that 
confers a specific functional advantage on an organism. Thus, Axe also 
estimated (a) the number of proteins of modest length (150 residues) that 
perform a specified function via any folded structure compared to (b) the whole 
set of possible amino-acid sequences of that size. Based on his experiments and 
data about the number of stable folded proteins that exist, Axe estimated that 
ratio to be about 1 to 107”. A telling conclusion follows from this experimental 
data: The probability of any given mutational trial generating (or “finding”) a 
specific functional protein among all the possible 150 residue amino-acid 
sequences is 1 chance in 10’’—that is, one chance in one hundred thousand, 
trillion, trillion, trillion, trillion, trillion, trillion. 

That is obviously an incredibly small probability, but is it small enough to 
justify rejecting the classical model of gene evolution? Or is it plausible to think 
that random mutations in the nonfunctional part of the genome could overcome 
these long odds to generate the genetic information necessary to produce a novel 
protein fold with a specific selectable function? 


HOW MANY TRIALS? 


When statisticians or scientists assess whether a chance hypothesis provides a 
plausible explanation for the occurrence of an event, they do not just evaluate the 
probability of that particular event occurring once; they evaluate the probability 
of the event occurring given the number of opportunities it has to occur. 

For example, if our hypothetical bike thief from the previous chapter had 
enough time to try more than half (more than 500 of the 1000) total 
combinations of a three-dial bike lock, then the probability that he will stumble 
upon the right combination will exceed the probability that he will fail. In that 
case, it will be more likely than not that he will succeed in opening the lock by 
chance. In that case, the chance hypothesis—the hypothesis that he will succeed 
in opening the lock by chance—is more likely to be true than false. On the other 
hand, if just after he started trying to crack the lock he heard a security guard 
coming around the comer and only had time to explore a small fraction of the 
total number of possible combinations—far fewer than half—then it will be 
much more likely than not that he will fail to open the lock by chance. 
Consequently, anyone who knew his situation could conclude that the chance 


hypothesis is, in that case, much more likely to turn out false than true. 

When statisticians or scientists assess the probability of an event occurring by 
chance, they often assess what is called a conditional probability. In deciding the 
plausibility of a chance hypothesis, they assess the probability of the event given 
or “conditioned on” what else we know, especially what else is known about the 
number of opportunities the event has to occur. And they refer to the number of 
opportunities an event has to occur as “the probabilistic resources.” !® 

If the conditional probability of the chance hypothesis, given the number of 
opportunities it has to occur, is less than %, then it is more likely than not that the 
event will not happen by chance. It will be viewed as implausible—more likely 
to be false than true. Conversely, if the conditional probability of the chance 
hypothesis, given the number of opportunities it has to occur, is more than 4, 
then it is more likely than not that the event in question will occur by chance. It 
will be deemed plausible—more likely to be true than false. And, of course, the 
smaller the conditional probability associated with a hypothesis, the more 
implausible the hypothesis—the more likely the chance hypothesis is to be false 
than true. 

How then should we assess the chance hypothesis for the origin of biological 
information—in particular, the hypothesis that random mutations generated the 
information necessary to produce a novel protein fold with a selectable function? 
What is the conditional probability that such a folded protein could arise as the 
result of random mutations in duplicated nonfunctional sections of a genome? 
Axe realized that in order to answer that question he needed a way to estimate 
the number of opportunities that random mutations had for producing a new 
protein fold with a selectable function during the whole history of life on earth. 


THE BIOLOGICAL UNIVERSAL PROBABILITY BOUND 


Here, evolutionary theory itself provides the answer. Axe was interested in the 
number of times that mutations might have produced new sequences of bases in 
DNA that were capable of producing a new sequence of amino acids—one of the 
10’” possible sequences in the relevant sequence space. Yet not every such 
sequence of bases that mutations might generate constituted a relevant trial. In 
theory, mutations may alter a gene many times during the life cycle of an 
organism. Nevertheless, natural selection can only act on the new sequence of 
bases that is actually passed on to offspring. It might seem that it would be 
difficult to quantify the number of mutational trials in each generation. But even 
if we think of mutations repeatedly shuffling and reshuffling the arrangement of 
bases during the life cycle of an organism, only those mutations in the genes (or 
DNA) in the reproductive cells of parent organisms can have any effect on the 
next generation. Since Axe wanted to know how many novel sequences capable 
of generating a selectable function might have arisen in the history of life, he 
only needed to concern himself with those sequences that could be transmitted 
during reproduction. 

This meant that if he could estimate the total number of organisms that had 
lived during the history of life on earth and the number of new genes that 
mutations might produce and pass on to the next generation, he could establish 
an upper bound on the number of trials relevant to the evolutionary process. 

Axe knew that the huge population sizes of prokaryotes like bacteria dwarf 
the population sizes of all other organisms combined. Thus, estimates for the 
size of the bacterial population—plus a smidge for everything else—would 
approximate the size of the number of organisms living at any given time. Based 
on the average length of time of a bacterial generation and the time since the first 
appearance of bacterial life on earth (3.8 billion years ago), scientists have 
estimated that a total of about 10*° organisms have lived on earth since life first 
appeared.!? Axe made the assumption that each new organism received one new 
sequence of bases (one potential gene) capable of generating one of the possible 
amino-acid sequences in sequence space per generation. 

This was an extremely generous assumption. Since mutations have to be quite 
rare for life to survive, most bacterial cells inherit an exact copy of their parent’s 
DNA. Furthermore, the ones that differ from their parents are likely to carry a 
mutation that has already occurred many times in other cells. For these reasons, 


the actual number of new sequences sampled in the history of life is much lower 
than the total number of bacterial cells that have existed. Nevertheless, Axe 
assumed that one new gene per organism has been transmitted to the next 
generation. Thus, he used 10*° gene sequences as a liberal estimate of the total 
number of gene sequences (evolutionary trials) that have been generated to 
search sequence space in the history of life. 

Even so, 10*° represents only a tiny fraction—1 ten trillion, trillion, trillionth 
—of 10’’. Thus, the conditional probability of generating a gene sequence 
capable of producing a novel protein fold and function is still only 1 in 10°”. 
This means that if every organism from the dawn of time had generated, by 
random mutation, one new base sequence in the sequence space of interest, that 
would amount to only one 10 trillion, trillion, trillionth of the sequences in that 
space—the space that needs to be searched.And, since conditional probability of 
a new gene arising in the manner envisioned by the classical model turns out to 
be almost unimaginably less than %, the classical model turns out to be vastly 
more likely to be false than true. Thus, Axe concluded that a reasonable person 
should reject it. The probabilistic resources available to the classical model of 
gene evolution are simply far too small to tame 1 chance in 10’ (see Fig. 10.4). 
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FIGURE 10.4 
The top panel in this diagram represents the results of Axe’s mutagenesis 
experiments showing the extreme rarity of functional proteins in sequence space. 
Based on his experiments Axe estimated that there are 10’” possible sequences 











corresponding to a specific functional sequence 150 amino acids long. The 
second panel shows that functional amino-acid sequences are extremely rare 
even in relation to the total number of opportunities the evolutionary process 
would have had to generate novel sequences (on the assumption that each 
organism that has ever lived during the history of life produced one such 
sequence per generation). 


To appreciate why this model fails, consider the following illustration.After 
the 1975 Steven Spielberg film Jaws became a big hit, one small-town motel 
advertised “Shark-Free Pool.” Proponents of the second evolutionary scenario 
(gene duplication, followed by neutral evolution), envision an evolutionary pool 
where there are no consequences for mutational missteps—by analogy, a pool 
with no predators. But to extend the illustration, picture a predator-free pool the 
size of our galaxy. Now picture a blindfolded man dropped into the middle of it. 
He must swim to the far side, to the one spot on the edge of the pool where a 
ladder would give him a way out. He’s safe from predators, but it will do him no 
good. He needs direction, some way of gauging his progress, and an immense 
amount of time. But he has none of these and so he will arrive at the ladder in 
neither a hundred years, nor a hundred billion. Similarly, in the classical model 
of gene evolution, random mutations must thrash about aimlessly in immense 
combinatorial space, a space that could not be explored by this means in the 
entire history of life on earth, let alone in the few million years of the Cambrian 
explosion. 


TO BUILD AN ANIMAL 


Yet Axe’s calculations only hint at the full problem for neo-Darwinian theory. 
By bending over backwards not to overstate the improbability of generating a 
new protein fold and by focusing narrowly on that one aspect of the challenge 
confronting evolutionary theory, his figures vastly understate the improbability 
of building a Cambrian animal. There are several reasons for this. 

First, the Cambrian explosion as dated by fossil evidence took far less time 
than has elapsed since the origin of life on earth until the present (about 3.8 
billion years).*° Less time available for a given evolutionary transition means 
fewer generations of new organisms and fewer opportunities to generate new 
genes by which to search relevant sequence space. This makes it even harder to 
generate a new protein fold by chance in the relevant time period. 

Second, bacteria are by far the most common type of organisms included in 
Axe’s estimate of the total number of organisms that have lived on earth. Yet no 
one thinks that Cambrian animals evolved directly from bacteria. Nor does 
anyone think that the putative multicellular ancestors of the Cambrian forms 
would have been anywhere near as abundant as the bacterial populations that 
Axe used as the main basis of his estimate. A more realistic estimate for the 
number of possible animal ancestors would necessarily result in a much lower 
estimate for the number of gene sequences available for searching sequence 
space (corresponding to a single protein of modest length in a single Cambrian 
animal). Recall that, based on Axe’s estimates, the probability of generating just 
one gene (for a new functional protein fold) from all the bacteria (and other 
organisms) that have ever lived on earth is just 1 in 10 trillion, trillion, trillion. 
Consequently, the mechanism for searching sequence space that Axe determined 
to be extremely implausible must be judged far more implausible as a 
mechanism for producing the Cambrian information explosion, since there were 
far fewer multicellular organisms present in the Precambrian period than there 
have been total organisms present in the entire history of life. 

Third, building new animal forms requires generating far more than just one 
protein of modest length. New Cambrian animals would have required proteins 
much longer than 150 amino acids to perform necessary, specialized functions.*! 
For example, as previously noted, many of these Cambrian animals needed the 
complex protein lysyl oxidase to support their stout body structures. In addition 
to a novel protein fold, these molecules (in living organisms) comprise over 400 


precisely sequenced (nonrepeating) amino acids. Reasonable extrapolation from 
mutagenesis experiments done on shorter protein molecules suggests that the 
improbability of randomly producing functionally sequenced proteins of this 
length would be extremely unlikely to occur given the probabilistic resources 
(and duration) of the entire universe.?~ 

The mutation and selection mechanism faces a related obstacle. The 
Cambrian animals exhibit structures that would have required many new types of 
cells, each requiring many novel proteins to perform their specialized functions. 
But new cell types require not just one or two new proteins, but coordinated 
systems of proteins to perform their distinctive cellular functions. The unit of 
selection in such cases ascends to the system as a whole. Natural selection 
selects for functional advantage, but no advantage accrues from a new cell type 
until a system of servicing proteins is in place. But that means random mutations 
must, again, do the work of information generation without the help of natural 
selection—and, now, not simply for one protein, but for many proteins arising 
together. Yet the odds of this occurring by chance alone are, of course, far 
smaller than the odds of the chance origin of a single new gene or protein—so 
small as to render the chance origin of the information needed to build a new cell 
type fantastically improbable (and implausible) given even the most optimistic 
estimates for the length of the Cambrian explosion. 

Richard Dawkins has noted that scientific theories can rely on only so much 
“luck” before they cease to be credible.** But the second scenario, involving 
gene duplication and neutral evolution, by its own logic, precludes natural 
selection from playing a role in generating genetic information until after the 
fact. Thus, it relies entirely on “too much luck.” The sensitivity of proteins to 
functional loss, the rarity of proteins within combinatorial sequence space, the 
need for long proteins to build new cell types and animals, the need for whole 
new systems of proteins to service new cell types, and the brevity of the 
Cambrian explosion relative to rates of mutation—all conspire to underscore the 
immense implausibility of any scenario for the origin of Cambrian genetic 
information that relies upon random variation alone, unassisted by natural 
selection. 

Yet the classical model of gene evolution—which relies on neutral evolution 
—requires novel genes and proteins to arise, precisely, by random mutation 
alone. Adaptive advantage accrues after the generation of new functional genes 
and proteins. Natural selection cannot play a role until new functional 
information-bearing molecules have independently arisen. Thus, to return to 


Dawkins’s imagery, evolutionary theorists envisioned the need to scale the steep 
face of a precipice of which there is effectively no gradually sloping back side, 
since the smallest increment of structural innovation in the history of life—a new 
protein fold—itself presents a formidable Mount Improbable. 

By the way, Axe’s later experiments establishing the extreme rarity of protein 
folds in sequence space also show why random changes to existing genes 
inevitably efface or destroy function before they generate fundamentally new 
folds or functions (scenario one). If only one out of every 10”’ of the alternate 
sequences are functional, an evolving gene will inevitably wander down an 
evolutionary dead-end long before it can ever become a gene capable of 
producing a new protein fold. The extreme rarity of protein folds also entails 
their isolation from each other in sequence space. 


A CATCH-22 


Douglas Axe’s results highlight an acute dilemma for neo-Darwinism, a “catch- 
22.” On the one hand, if natural selection plays no role in generating new genes, 
as the idea of neutral evolution implies, then mutations alone must climb a 
Mount Improbable in a single leap—a situation that, given Axe’s results and 
Dawkins’s own logic, is probabilistically untenable. On the other hand, any 
model for the origin of genetic information that envisions a significant role for 
natural selection, by assuming a preexisting gene or protein under selective 
pressure, encounters other equally intractable difficulties. The evolving genes 
and proteins will range over a series of disadvantageous or nonfunctional 
intermediates that natural selection will not favor or preserve, but will, instead, 
eliminate. At that point, selection-driven evolution will cease, locking existing 
genes and proteins into place. 

Thus, whether one envisions the evolutionary process beginning with a 
preexisting functional gene or a duplicated noncoding region of the genome, the 
results of mutagenesis experiments present a precise quantitative challenge to the 
efficacy of the neo-Darwinian mechanism. Indeed, our growing knowledge 
about the rarity and isolation of proteins and functional genes in sequence space 
implies that neither neo-Darwinian scenario for producing new genes is at all 
plausible. Thus, neo-Darwinism does not explain the Cambrian information 
explosion. 
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Assume a Gene 


When I first heard that Douglas Axe had succeeded in making a rigorous 
estimate of the rarity of proteins in sequence space, I wondered what neo- 
Darwinists would say in response. Given the experimental rigor and 
mathematical precision of the work he reported in the Journal of Molecular 
Biology in 2004, and the long odds against mutation and selection ever finding a 
novel gene or functional protein, what could they say? That the probability of a 
successful search for new genes and proteins was higher than Axe’s experiments 
suggested? That his methods or calculations were flawed? That no one else had 
gotten similar results? Since Axe’s work confirmed other analyses and 
experiments, and since his paper had passed through the careful scrutiny of peer 
review, none of those responses seemed plausible. Yet defenders of the adequacy 
of the neo-Darwinian mechanism were far from admitting defeat, as I would 
soon find out. 

The same year, I published a peer-reviewed scientific article about the 
Cambrian explosion and the problem of the origin of the biological information 
needed to explain it.! In the paper, I cited Axe’s results and explained why the 
rarity of functional proteins in sequence space posed such a severe challenge to 
the adequacy of the neo-Darwinian mechanism. The article appeared in a 
biology journal, Proceedings of the Biological Society of Washington, published 
out of the Smithsonian Institution by scientists working for the Smithsonian’s 
National Museum of Natural History (NMNH). Because the article also argued 
that the theory of intelligent design could help explain the origin of biological 
information (see Chapter 18), its publication created a firestorm of controversy. 

Museum scientists and evolutionary biologists from around the country were 
furious with the journal and its editor, Richard Sternberg, for allowing the article 
to be peer-reviewed and then published. Recriminations followed. Museum 
officials took away Sternberg’s keys, his office, and his access to scientific 


samples. He was transferred from a friendly to a hostile supervisor. A 
congressional subcommittee staff later investigated and found that museum 
officials initiated an intentional disinformation campaign against Sternberg in an 
attempt to get him to resign. His detractors circulated false rumors: “Sternberg 
has no degrees in biology” (actually he has two Ph.D.’s, one in evolutionary 
biology and one in systems biology); “He is a priest, not a scientist” (Sternberg 
is not a priest, but a research scientist); “He is a Republican operative working 
for the Bush campaign” (he was far too busy doing scientific research to be 
involved in political campaigns, Republican or otherwise); “He’s taken money to 
publish the article” (not true); and so on. Eventually, despite the demonstrable 
falsehood of the charges, he was demoted.” 

Major news stories about the controversy appeared in Science, Nature, The 
Scientist, and the Chronicle of Higher Education.° Then articles appeared in the 
mainstream press, including the Washington Post and the Wall Street Journal.* A 
major story aired on National Public Radio.° Sternberg himself even appeared on 
The O’Reilly Factor. 

Despite the intense furor, there was no formal scientific response to my 
article: neither the Proceedings nor any other scientific journal published a 
scientific refutation. The members of the Council of the Biological Society of 
Washington who oversaw the publication of the journal insisted that they didn’t 
want to dignify it by responding. 

Eventually two scientists and a science education policy advocate—each 
associated with the National Center for Science Education, a group that lobbies 
for teaching evolution in the public schools—stepped forward. The three authors 
—geologist Alan Gishlick, education policy advocate Nicholas Matzke, and 
wildlife biologist Wesley R. Elsberry—published a response to my article on 
TalkReason.org, a prominent atheistic website.° Although the website’s 
guidelines prohibit “ad hominem arguments,” the rule was somewhat loosely 
enforced in the case of Gishlick, Matzke, and Elsberry’s response, which they 
titled “Meyer’s Hopeless Monster.” 

Gishlick, Matzke, and Elsberry attempted to refute my central argument by 
citing a scientific paper that they said solved the problem of the origin of genetic 
information. The paper, a scientific review essay titled “The Origin of New 
Genes: Glimpses from the Young and Old,” had appeared in Nature Reviews 
Genetics in 2003. Gishlick, Matzke, and Elsberry asserted that this paper— 
coauthored by Manyuan Long, an evolutionary biologist at the University of 
Chicago, and several colleagues—was representative of an extensive “scientific 


literature documenting the origin of new genes.”’ 


Other biologists echoed Gishlick, Matzke, and Elsberry’s claim in the context 
of another public controversy. During the 2005 Kitzmiller v. Dover trial about an 
ill-advised attempt to require teachers in a Pennsylvania school district to read a 
Statement about intelligent design, Brown University biologist Kenneth Miller 
cited Long’s paper in his testimony. He said that it shows how new genetic 
information evolves. The judge in the case, John E. Jones, then cited Miller’s 
testimony about Long’s article in his own decision. Judge Jones asserted there 
are “more than three dozen peer-reviewed scientific publications showing the 
origin of new genetic information by evolutionary processes.”® Elsewhere 
Matzke, along with biologist Paul Gross, stated that the paper by Long “reviews 
all the mutational processes involved in the origin of new genes and then lists 
dozens of examples in which research groups have reconstructed the genes’ 
origins.”’ In their view, “Competent scientists know how new genetic 
information arises.” !° 

But do evolutionary biologists really know this? 

Let’s take a closer look at the article that allegedly shows “how new genetic 
information arises.” !! 


ONCE UPON A GENE THE OFT-CITED LONG PAPER POINTS TO A 
VARIETY OF STUDIES THAT PURPORT TO EXPLAIN THE EVOLUTION 
OF VARIOUS GENES. THESE STUDIES TYPICALLY BEGIN BY TAKING 

A GENE AND THEN SEEKING TO FIND OTHER GENES THAT ARE 

SIMILAR (OR HOMOLOGOUS) TO IT. THEY THEN SEEK TO TRACE 

THE HISTORY OF SLIGHTLY DIFFERENT HOMOLOGOUS GENES BACK 
TO A HYPOTHETICAL COMMON ANCESTOR GENE (OR GENES). TO 
DO THIS, THE STUDIES SURVEY DATABASES OF GENE SEQUENCES 

LOOKING FOR SIMILAR SEQUENCES IN REPRESENTATIVES OF 
DIFFERENT TAXONOMIC GROUPS—OFTEN IN CLOSELY RELATED 

SPECIES. SOME STUDIES ALSO ATTEMPT TO ESTABLISH THE 

EXISTENCE OF A COMMON ANCESTOR GENE ON THE BASIS OF 
SIMILAR GENES WITHIN THE VERY SAME ORGANISM. THEY THEN 
TYPICALLY PROPOSE EVOLUTIONARY SCENARIOS IN WHICH AN 

ANCESTRAL GENE DUPLICATES ITSELF,!* AND THEN THE 
DUPLICATE AND THE ORIGINAL EVOLVE DIFFERENTLY AS THE 
RESULT OF SUBSEQUENT MUTATIONS IN EACH GENE. 


Next, these scenarios invoke various kinds of mutations—duplication events, 
exon shuffling, retropositioning, lateral gene transfer, and subsequent point 
mutations—as well as the activity of natural selection (see Fig. 11.1). The 
evolutionary biologists conducting these studies postulate that modern genes 
arose as the result of these various mutational processes—processes that they 
envision as having shaped genes during a long evolutionary history. Since the 
information in modern genes is presumably different from the information in the 
hypothetical ancestor genes, they regard the mutational mechanisms that are 
allegedly responsible for these differences as the explanation for the origin of 
genetic information. 

Upon closer examination, however, none of these papers demonstrate how 
mutations and natural selection could find truly novel genes or proteins in 
sequence space in the first place; nor do they show that it is reasonably probable 
(or plausible) that these mechanisms would do so in the time available. These 
papers assume the existence of significant amounts of preexisting genetic 
information (indeed, many whole and unique genes) and then suggest various 
mechanisms that might have slightly altered or fused these genes together into 
larger composites. At best, these scenarios “trace” the history of preexisting 
genes, rather than explain the origin of the original genes themselves (see Fig. 
11.2). 

This kind of scenario building can suggest potentially fruitful avenues of 
research. But an obvious error comes in mistaking a hypothetical scenario for 
either a demonstration of fact or an adequate explanation. None of the scenarios 
that the Long paper cites demonstrate the mathematical or experimental 
plausibility of the mutational mechanisms they assert as explanations for the 
origin of genes. Nor do they directly observe the presumed mutational processes 
in action. At best, they provide hypothetical, after-the-fact reconstructions of a 
few events out of a sequence of many supposed events, starting with the 
existence of a presumed common ancestor gene. But that gene itself does not 
represent a hard data point. It is inferred to have existed on the basis of the 
similarity of two or more other existing genes, which are the only actual pieces 
of observational evidence upon which these often elaborate scenarios are based. 
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FIGURE 11.1 
Various types of mutations that are alleged to result in the modification of genes: exon shuffling, 
retropositioning, lateral gene transfer, and gene fusion. 
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FIGURE 11.2 
Depiction of how gene duplication and subsequent gene evolution might take 


place. While the gene on the bottom remains under selective pressure and cannot 
experience many mutations without loss of fitness or function (see Figure 10.3), 
the duplicated gene at the top can in theory vary without deleterious 
consequences to the organism. 


That these scenarios depend on various inferences and postulations doesn’t, 
by itself, disqualify them from consideration. Nevertheless, whether they 
adequately explain the origin of genetic information depends upon the evidence 
for the existence of the entities they infer (the ancestral genes) and the 
plausibility of the mutational mechanisms they postulate. Let’s look at both parts 
of these scenarios. 


COMMON ANCESTOR GENES? 


Nearly all of the scenarios developed in the papers that Long cites start with an 
inferred common ancestral gene from which two or more modern genes diverged 
and developed. These scenarios treat the similarity of sequence (the information) 
in two or more genes as unequivocal evidence for a common ancestral gene (see 
Fig. 11.2). As I noted in Chapters 5 and 6, standard methods of phylogenetic 
reconstruction presuppose, rather than demonstrate, that biological similarity 
results from shared ancestry. Yet, as we saw in Chapter 6, similarity of sequence 
by itself is not always an unequivocal indicator of common ancestry. Sometimes 
similarity appears between species where it cannot be explained by inheritance 
from a common ancestor (e.g., the similar forelimbs on moles and mole crickets) 
and, at the very least, there are other possible explanations for sequence 
similarity. 

In the first place, similar gene sequences might have evolved independently 
on two parallel lines of descent starting with two different genes, as the 
hypothesis of convergent evolution asserts. Recent examples of convergent 
genetic evolution now abound in the literature of molecular and evolutionary 
biology.'? For example, molecular biologists have discovered that both whales 
and bats use similar systems—involving similar genes and proteins—for 
echolocation. The striking similarity of these systems used in two otherwise 
disparate mammalian species has led biologists to posit the parallel evolution of 
echolocation, including the gene sequences and proteins that make it possible, 
from a common ancestor that did not possess this system.!4 

In addition, it is possible that similar genes might have been separately 


designed to meet similar functional needs in different organismal contexts. 
Viewed this way, similarity of sequence does not necessarily reflect descent with 
modification from a common ancestor, but could reflect design in accord with 
common functional considerations, constraints, or goals. I recognize, of course, 
that to this point I have not given any independent reasons for considering the 
design hypothesis, and that, as a hypothesis for sequence similarity by itself, 
intelligent design may not seem compelling. (For more compelling reasons to 
consider intelligent design, see Chapters 17 through 19.) Nevertheless, I mention 
both these other possible explanations for the similarity of gene sequences in 
order to demonstrate that sequence similarity does not necessarily indicate, or 
derive from, a common ancestral gene. 


ORFAN GENES SOME GENES, AND THE INFORMATION-RICH 
SEQUENCES THEY CONTAIN, MOST CERTAINLY CANNOT BE 
EXPLAINED BY REFERENCE TO THE KIND OF SCENARIOS THAT 
LONG CITES. ALL OF THESE SCENARIOS ATTEMPT TO EXPLAIN THE 
ORIGIN OF TWO SIMILAR GENES BY REFERENCE TO DESCENT WITH 
MODIFICATION (VIA MUTATION) FROM COMMON ANCESTRAL 
GENES. YET GENOMIC STUDIES ARE NOW TURNING UP HUNDREDS 
OF THOUSANDS OF GENES IN MANY DIVERSE ORGANISMS THAT 
EXHIBIT NO SIGNIFICANT SIMILARITY IN SEQUENCE TO ANY OTHER 
KNOWN GENE." THESE “TAXONOMICALLY RESTRICTED GENES” OR 
“ORFANS” (FOR “OPEN READING FRAMES OF UNKNOWN ORIGIN”) 
NOW DOT THE PHYLOGENETIC LANDSCAPE. ORFANS HAVE TURNED 
UP IN EVERY MAJOR GROUP OF ORGANISMS, INCLUDING PLANTS 
AND ANIMALS AS WELL AS BOTH EUKARYOTIC AND PROKARY OTIC 
ONE-CELLED ORGANISMS. IN SOME ORGANISMS, AS MUCH AS ONE- 
HALF OF THE ENTIRE GENOME COMPRISES ORFAN GENES.!° 


Thus, even if it could be assumed that similar gene sequences always point to 
a common ancestor gene, these ORFan genes cannot be explained using the kind 
of scenarios that Long’s article cites. Since ORFans lack sequence similarity to 
any known gene—that is, they have no known homologs in even distantly 
related species—it is impossible to posit a common ancestral gene from which a 
particular ORFan and its homolog might have evolved. Remember: ORFans, by 
definition, have no homologs. These genes are unique—one of a kind—a fact 
tacitly acknowledged by the increasing number of evolutionary biologists who 


attempt to “explain” the origin of such genes through de novo (“out of 
nowhere”) origination. 

Some might argue that as biologists map the sequence of more genomes and 
add more gene sequences to protein databases, homologs of these ORFans will 
eventually turn up, thus gradually eliminating the mystery surrounding the 
ORFan phenomenon. Yet to date the trend has gone in the opposite direction. As 
scientists have explored and sequenced more genomes, they have discovered 
more and more ORFans without finding anything like a corresponding number 
of homologs. Instead, the number of “unpaired” ORFan genes continues to grow 
with no sign of the trend reversing itself.'” 


THE PLAUSIBILITY OF THE MUTATIONAL PROCESSES EVEN IF 
EVOLUTIONARY BIOLOGISTS COULD ESTABLISH THE EXISTENCE 
OF THE COMMON ANCESTRAL GENES FROM WHICH THEIR 
SCENARIOS BEGIN, THAT WOULD NOT ESTABLISH THE 
PLAUSIBILITY OF A NEO-DARWINIAN MECHANISM FOR 
GENERATING GENETIC INFORMATION FROM THAT ANCESTOR. 
MOREOVER, THE TERM “PLAUSIBILITY” IN THIS CONTEXT HAS A 
SPECIFIC SCIENTIFIC AND METHODOLOGICALLY SIGNIFICANT 
MEANING. STUDIES IN THE PHILOSOPHY OF SCIENCE SHOW THAT 
SUCCESSFUL EXPLANATIONS IN HISTORICAL SCIENCES SUCH AS 
EVOLUTIONARY BIOLOGY NEED TO PROVIDE “CAUSALLY 
ADEQUATE” EXPLANATIONS—THAT IS, EXPLANATIONS THAT CITE A 
CAUSE OR MECHANISM CAPABLE OF PRODUCING THE EFFECT IN 
QUESTION. IN ON THE ORIGIN OF SPECIES, DARWIN REPEATEDLY 
ATTEMPTED TO SHOW THAT HIS THEORY SATISFIED THIS 
CRITERION, WHICH WAS THEN CALLED THE VERA CAUSA (OR “TRUE 
CAUSE”) CRITERION. IN THE THIRD CHAPTER OF THE ORIGIN, FOR 
EXAMPLE, HE SOUGHT TO DEMONSTRATE THE CAUSAL ADEQUACY 
OF NATURAL SELECTION BY DRAWING ANALOGIES BETWEEN IT 
AND THE POWER OF ANIMAL BREEDING AND BY EXTRAPOLATING 
FROM OBSERVED INSTANCES OF SMALL-SCALE EVOLUTIONARY 
CHANGE OVER SHORT PERIODS OF TIME. 


In this, Darwin hewed to a principle of scientific reasoning that one of his 
scientific role models, the great geologist Charles Lyell, used as a guide for 
reasoning about events in the remote past. Lyell insisted that good explanations 


for the origin of geological features should cite “causes now in operation”— 
causes known from present experience to have the capacity to produce the 
effects under study.!® 

Do the scenarios developed by various evolutionary biologists cited in the 
Long review essay meet this criterion? Duplication mutations and various other 
modes of random mutational change along with natural selection clearly 
constitute “causes now in operation.” No one disputes that. But have these 
processes demonstrated the capacity to produce the effect in question, namely, 
the genetic information necessary to structural innovation in the history of life? 
There are several good reasons to think that they have not. 


BEGGING QUESTIONS FIRST, MOST OF THE MUTATIONAL 
PROCESSES THAT EVOLUTIONARY BIOLOGISTS INVOKE IN THE 
SCENARIOS CITED IN THE LONG ESSAY PRESUPPOSE SIGNIFICANT 
AMOUNTS OF PREEXISTING GENETIC INFORMATION ON 
PREEXISTING GENES OR MODULAR SECTIONS OF DNA OR RNA. THE 
LONG ESSAY HIGHLIGHTS SEVEN MAIN MUTATIONAL MECHANISMS 
AT WORK IN THE SCULPTING OF NEW GENES: (1) EXON SHUFFLING, 
(2) GENE DUPLICATION, (3) RETROPOSITIONING OF MESSENGER RNA 
TRANSCRIPTS, (4) LATERAL GENE TRANSFER, (5) TRANSFER OF 
MOBILE GENETIC UNITS OR ELEMENTS, (6) GENE FISSION OR 
FUSION, AND (7) DE NOVO ORIGINATION (SEE FIG. 11.1). YET EACH 
OF THESE MECHANISMS, WITH THE EXCEPTION OF DE NOVO 
GENERATION, BEGINS WITH PREEXISTING GENES OR EXTENSIVE 
SECTIONS OF GENETIC TEXT. THIS PREEXISTING FUNCTIONALLY 
SPECIFIED INFORMATION IS IN SOME CASES ENOUGH TO CODE FOR 
THE CONSTRUCTION OF AN ENTIRE PROTEIN OR A DISTINCT 
PROTEIN FOLD. MOREOVER, THESE SCENARIOS NOT ONLY ASSUME 
UNEXPLAINED PREEXISTING SOURCES OF BIOLOGICAL 
INFORMATION, THEY DO SO WITHOUT EXPLAINING OR EVEN 
ATTEMPTING TO EXPLAIN HOW ANY OF THE MECHANISMS THEY 
ENVISION COULD HAVE SOLVED THE COMBINATORIAL SEARCH 
PROBLEM DESCRIBED IN CHAPTERS 9 AND 10. 


A closer look at each of these mechanisms will show why scenarios that rely 
on them beg important questions about the origin of genetic information. 
Advocates of exon shuffling envision modular sections of a genome 


randomly arranging and rearranging themselves to generate entirely new genes, 
not unlike rearranging whole paragraphs in an essay to generate a new article. In 
genomes that have regions that code for the production of proteins interspersed 
with regions that do not code for proteins, the term “exon” refers to a protein- 
coding region of the genome. These protein-coding regions of the genome are 
often interrupted by nonprotein-coding sections of the genome (called introns) 
that serve other functions, such as coding for the production of regulatory RNAs. 
In any case, exons store significant quantities of preexisting functionally 
specified information. 

Though most proteins are encoded by multiple exons, a single exon may 
encode a substantial unit of protein structure, such as a functional protein fold— 
a fact that advocates of exon shuffling count in their own attempts to explain 
novel proteins. They assume that exons can be blindly shuffled and mixed 
around to form genes. Nevertheless, this mechanism cannot produce new protein 
folds. Either an exon is large enough that it already encodes a protein fold—in 
which case it’s not creating a new fold—or it’s too small, small enough that 
multiple exons must be combined in order to form a stable protein fold. In this 
latter case, other problems—in particular, something called adverse side-chain 
interaction—will preclude success, as we will see. 

Evolutionary scenarios envisioning other mutational mechanisms also 
presuppose important sources of preexistent genetic information. Gene 
duplication, as the name implies, involves the production of a duplicate copy of 
a preexisting gene, already rich in functionally specified information. 
Retropositioning of messenger RNA transcripts occurs when an enzyme called 
reverse transcriptase takes a preexisting strand of messenger RNA and inserts its 
corresponding DNA sequence into a genome, also producing a duplicate of the 
coding portion of a preexisting gene. Lateral gene transfer involves transferring a 
preexisting gene from one organism (usually a bacterium) into the genome of 
another. The transfer of mobile genetic elements likewise occurs when 
preexisting genes enclosed in circular strands of DNA called plasmids enter one 
organism from another and eventually find themselves incorporated into a new 
genome. This process also mainly occurs in single-celled organisms. A similar 
process can occur in eukaryotes, where mobile genetic elements called 
transposons—often called “jumping genes”!°—can hop from place to place in 
the genome. Gene fusion occurs when two adjacent preexisting genes, each rich 
with specified genetic information, link together after the deletion of intervening 
genetic material.”?° 


Each of these six mutational mechanisms presupposes preexisting modules of 
specified genetic information. Some of these mutational mechanisms also 
depend upon sophisticated preexistent molecular machines such as the reverse 
transcriptase enzyme used in retropositioning or other complex cellular 
machinery involved in DNA replication. Since building these machines requires 
other sources of genetic information, scenarios that presuppose the availability 
of such molecular machines to assist in the cutting, splicing, or positioning of 
modular sections of genetic information clearly beg the question. 

Overall, what evolutionary biologists have in mind is something like trying to 
produce a new book by copying the pages of an existing book (gene duplication, 
lateral gene transfer, and transfer of mobile genetic elements), rearranging blocks 
of text on each page (exon shuffling, retropositioning, and gene fusion), making 
random spelling changes to words in each block of text (point mutations), and 
then randomly rearranging the new pages. Clearly, such random rearrangements 
and changes will have no realistic chance of generating a literary masterpiece, let 
alone a coherent read. That is to say, these processes will not likely generate 
specificity of arrangement and sequence and, therefore, do not solve the 
combinatorial search problem. In any case, all such scenarios also beg the 
question. There is a big difference between shuffling and slightly altering 
preexisting sequence-specific modules of functional information and explaining 
how those modules came to possess information-rich sequences in the first place. 


EVOLUTION EX NIHILO? 


Long does cite at least one type of mutation that does not presuppose existing 
genetic information, the de novo origination of new genes. For example, one 
paper he discusses sought to explain the origin of a promoter region for a gene 
(the part of the gene that helps initiate the transcription of the gene’s 
instructions) and found that “this unusual regulatory region did not really 
‘evolve.’ ” Instead, it somehow snapped into being: “It was aboriginal, created 
de novo by the fortuitous juxtaposition of suitable sequences.”! 

Many other papers invoke de novo origination of genes. Long mentions, for 
example, a study seeking to explain the origin of an antifreeze protein in an 
Antarctic fish that cites “de novo amplification of a short DNA sequence to 
spawn a novel protein with a new function.”*? Likewise, Long cites an article in 
Science to explain the origin of two human genes involved in neurodevelopment 
that appealed to “de novo generation of building blocks—single genes or gene 


segments coding for protein domains,” where an exon spontaneously “originated 
from a unique noncoding sequence.”** Other papers make similar appeals. A 
paper in 2009 reported “the de novo origin of at least three human protein- 
coding genes since the divergence with chimp[s],” where each of them “has no 
protein-coding homologs in any other genome.”** An even more recent paper in 
PLoS Genetics reported “60 new protein-coding genes that originated de novo on 
the human lineage since divergence from the chimpanzee,”*° a finding that was 
called “a lot higher than a previous, admittedly conservative, estimate.”° 

Another 2009 paper in the journal Genome Research was appropriately titled 
“Darwinian Alchemy: Human Genes from Noncoding RNA.” It investigated the 
de novo origin of genes and acknowledged, “The emergence of complete, 
functional genes—with promoters, open reading frames (ORFs), and functional 
proteins—from ‘junk’ DNA would seem highly improbable, almost like the 
elusive transmutation of lead into gold that was sought by medieval 
alchemists.”*” Nonetheless, the article asserted without saying how that: 
“evolution by natural selection can forge completely new functional elements 
from apparently nonfunctional DNA—the process by which molecular evolution 
turns lead into gold.”78 

The presence of unique gene sequences forces researchers to invoke de novo 
origin of genes more often than they would like. After one study of fruit flies 
reported that “as many as ~12% of newly emerged genes in the Drosophila 
melanogaster subgroup may have arisen de novo from noncoding DNA,”’? the 
author went on to acknowledge that invoking this “mechanism” poses a severe 
problem for evolutionary theory, since it doesn’t really explain the origin of any 
of its “nontrivial requirements for functionality.”°° The author proposes that 
“preadaptation” might have played some role. But that adds nothing by way of 
explanation, since it only specifies when (before selection played a role) and 
where (in noncoding DNA), not how the genes in question first arose. Details 
about how the gene became “preadapted” for some future function is never 
explained. Indeed, evolutionary biologists typically use the term “de novo 
origination” to describe unexplained increases in genetic information; it does not 
refer to any known mutational process. 

Taking stock, then, many of the mutational processes that Long cites either: 
(1) beg the question as to the origin of the specified information contained in 
genes or parts of genes, or (2) invoke completely unexplained de novo jumps— 
essentially evolutionary creation ex nihilo (“from nothing”). 


Thus, ultimately, the scenarios featured in Long’s review essay do not explain 
the origin of the specified information in either genes or sections of genes. That 
would require a cause capable of solving the combinatorial inflation problem 
discussed in the previous chapters. But none of the scenarios discussed in Long’s 
article even addresses this problem, let alone demonstrates the mathematical 
plausibility of the mechanisms they cite. Yet, Gishlick, Matzke, and Elsberry 
originally cited Long as a definitive refutation of my article—the one in which I 
argued that the rarity of genes and proteins in sequence space cast doubt on the 
power of selection and mutations to generate novel genetic information. 
Professor Miller, in his testimony at the celebrated Dover trial, even convinced a 
federal judge to affirm that Long had succeeded in demonstrating how genetic 
information originates in a celebrated legal ruling. Clearly, one cannot solve a 
problem or refute an argument by failing to address it.*! 


PROTEIN FOLDS: PLAUSIBLE BUT IRRELEVANT SCENARIOS 
THERE IS ASECOND AND CLOSELY RELATED DIFFICULTY 
ASSOCIATED WITH THE SCENARIOS CITED BY LONG. TYPICALLY, 
THEY DO NOT EVEN TRY TO EXPLAIN THE ORIGIN OF NEW PROTEIN 
FOLDS, AND FEW OF THEM ANALYZE GENES DIFFERENT ENOUGH 
FROM EACH OTHER THAT THEIR PROTEIN PRODUCTS COULD, EVEN 
CONCEIVABLY, EXEMPLIFY DIFFERENT FOLDS. INSTEAD, THEY 
USUALLY ATTEMPT TO EXPLAIN THE ORIGIN OF HOMOLOGOUS 
GENES—GENES THAT PRODUCE PROTEINS WITH THE SAME FOLDED 
STRUCTURE PERFORMING THE SAME FUNCTION OR A CLOSELY 
RELATED ONE. 


For example, Long cites one study comparing the two genes RNASE1 and 
RNASE1B, which code for homologous digestive enzymes.** The two proteins 
perform nearly the same function: breaking down RNA molecules in the 
digestive tracts of colobine leaf-eating monkeys, though each does its job at a 
slightly different optimal chemical pH. More important, given that the amino- 
acid sequences of the two enzymes are 93 percent identical, structural biologists 
would expect both enzymes to utilize the same protein fold to accomplish their 
closely related tasks. 

Long also references a study of a gene that codes for a histone protein, Cid, in 
two closely related species of fruit flies—Drosophila melanogaster and 
Drosophila simulans. The study didn’t try to explain the “origin” of the gene—it 


merely compared the gene in the two species, catalogued some minor differences 
between them, and asked how those differences arose. The study identified some 
two dozen nucleotide differences between the genes for Cid in the two species— 
only 17 of which might have changed an amino acid in the sequence out of 226 
total amino acids in the Cid protein.*’ Such a slight (7.5 percent) difference 
would be extremely unlikely to translate into different protein folds. Indeed, 
natural sequences known to have different folds do not have anything like the 
correspondingly high degree (92.5 percent) of sequence identity. Instead, known 
natural sequences with this high level of sequence identity have the same fold. 

Long also cites two studies of FOXP2, a gene involved in regulating gene 
expression in humans, chimps, other primates, and mammals. In humans and 
other mammals, this gene is involved in brain development.** Nevertheless, 
according to one study, the protein coded for by this gene in humans acquired 
just “two amino-acid changes on the human lineage”*? during the entire course 
of its evolution from a common chimp-human ancestor—again, not likely a 
sufficient enough change to generate a new protein fold. 

The Long review essay cites numerous scenarios of this type—scenarios 
attempting to explain the evolution of slight gene variants (and their similar 
proteins), not the origin of new protein folds. This is an important distinction 
because, as we saw in Chapter 10, new protein folds represent the smallest unit 
of selectable structural innovation, and much larger structural innovations in the 
history of life depend upon them. Explaining the origin of structural innovation 
requires more than just explaining the origin of variant versions of the same gene 
and protein or even the origin of new genes capable of coding for new protein 
functions. It requires producing enough genetic information—truly novel genes 
—to produce new protein folds. 

Thus, even where these scenarios are plausible, they are not relevant to 
explaining the origin of the genetic information necessary to produce the kind of 
structural innovation that occurs in the Cambrian explosion (or many other 
events in the history of life). 


PROTEIN FOLDS: RELEVANT BUT IMPLAUSIBLE SCENARIOS IN A 
FEW CASES, THE EVOLUTIONARY SCENARIOS CITED IN THE LONG 
PAPER APPEAR TO BE ATTEMPTS TO EXPLAIN GENES THAT ARE 
DIFFERENT ENOUGH FROM EACH OTHER THAT THEY COULD 
CONCEIVABLY CODE FOR PROTEINS WITH DIFFERENT FOLDS. FOR 
EXAMPLE, LONG DISCUSSES SEVERAL PAPERS THAT EQUATE EXON 


SHUFFLING WITH THE SHUFFLING OF PROTEIN DOMAINS. RECALL 
THAT A PROTEIN DOMAIN IS A STABLE “TERTIARY” PROTEIN 
STRUCTURE OR FOLD MADE OF MANY SMALLER “SECONDARY” 
STRUCTURES SUCH AS ALPHA HELICES OR BETA STRANDS (SEE FIG. 
10.2). MANY COMPLEX PROTEINS HAVE NUMEROUS DOMAINS, 
EACH EXHIBITING A UNIQUE FOLD OR TERTIARY STRUCTURE. ONE 
VERSION OF THE EXON-SHUFFLING HYPOTHESIS ASSUMES THAT 
EACH EXON CODES FOR A SPECIFIC PROTEIN DOMAIN. IT 
ENVISIONS RANDOM CUTTING AND SPLICING—EXCISING, 
SHUFFLING, AND RECOMBINATION—OF THE EXON PORTIONS OF 
THE GENOME, RESULTING IN THE MODULAR REARRANGEMENT OF 
GENETIC INFORMATION. THE RESULTING COMPOSITE GENE WILL 
THEN CODE FOR A NEW COMPOSITE PROTEIN STRUCTURE. AS 
LONG PROPOSES, “EXON SHUFFLING, WHICH IS ALSO KNOWN AS 
DOMAIN SHUFFLING, OFTEN RECOMBINES SEQUENCES THAT 
ENCODE VARIOUS PROTEIN DOMAINS TO CREATE MOSAIC 
PROTEINS.”?° 


Of the mechanisms that Long discusses, exon shuffling (and the closely 
related idea of gene fusion) provides perhaps the most plausible means of 
generating new (composite) proteins.*’ Nevertheless, the idea that exon shuffling 
can explain the origin of the genetic information necessary to produce new 
protein folds or whole composite proteins is problematic for several reasons. 

First, the exon-shuffling hypothesis seems to assume that each exon involved 
in the process codes for a protein domain that folds into a distinctive tertiary 
structure. To a protein scientist, a protein domain is equivalent to a protein fold, 
though distinct protein structures (folds) may be composed of several smaller 
domains (or folds). Thus, at the very least, the exon-shuffling hypothesis 
presupposes the prior existence of a significant amount of genetic information— 
enough information to build an independent protein domain or fold. As such, it 
fails as an explanation for the origin of protein folds and the information 
necessary to produce them. 

Some advocates of exon shuffling, however, may be using the term “protein 
domain” in a slightly fuzzier way. They might be equating domains with smaller 
structural units such as fragments of a fold made of several units of secondary 
structure such as alpha helices or beta strands. Conceived this way, the exon- 
shuffling hypothesis would then entail the construction of a new protein structure 


by combining these smaller “fragments.” 

But in most cases, if the amino-acid chain that forms a domain is chopped 
into fragments, then the resulting isolated pieces would cease to retain their 
original shapes. Why? Because the three-dimensional shape of one small section 
of a protein is heavily dependent upon the overall structure and shape of the rest 
of the protein. Snip out a section or fragment, or synthesize a fragment in 
isolation from the rest of the protein, and a floppy amino-acid chain will result— 
one that has entirely lost its original shape, or ability to form a stable structure. 
Thus, this version of the exon-shuffling hypothesis lacks credibility because it 
incorrectly assumes that shapeless protein fragments can be mixed and matched 
in a modular fashion to form new stable, functional protein folds. Moreover, 
even if such shuffling were physically plausible, this version of the hypothesis 
would have another problem. It still presupposes unexplained functional 
information—in particular, the information necessary to specify, not just the 
smaller fragments, but also the information required to arrange these smaller 
units into stable folds, and ultimately functional proteins. 

Second, since the exon-shuffling hypothesis assumes that each exon involved 
in the shuffling codes for a specific protein domain, it also assumes that exon 
boundaries correspond to the boundaries of protein domains or folds. In existing 
genes, however, exon boundaries do not typically correspond to the boundaries 
of folded domains within the larger proteins.°° If the shuffling of exons 
explained how actual proteins had come into existence, then there should be a 
clear correspondence or correlation between exon boundaries within genes and 
the corresponding protein domains within larger composite structures (i.e., 
whole proteins). The absence of such a correspondence suggests that exon 
shuffling does not account for the origin of known compound protein structures. 

Third, relying on exon shuffling to cobble together a new protein fold from 
smaller units of protein structure is physically implausible for another reason. To 
see why, we need to examine what a “side chain” is. All twenty protein-forming 
amino acids have a common backbone (made of nitrogen, carbon, and oxygen), 
but each one has a different chemical group called a side chain sticking out at 
roughly right angles from that backbone. The interactions between side chains 
determine whether secondary units of protein structure made from chains of 
amino acids will fold into larger stable three-dimensional folds.29 Though many 
different sequences will generate secondary structures (alpha helices and beta 
strands), generating stable folds is much more difficult and requires much more 
specificity in the arrangement of amino acids and their side chains. Specifically, 


since the elements in smaller secondary structural units in proteins are 
surrounded by side chains, they cannot be combined into new folds unless the 
elements have the sequence specificity required for the side chains to 
complement one another.*? That means smaller secondary structural units will 
rarely*! fuse together to form stable tertiary structures or folds. Instead, attempts 
to form new folds from smaller units of structure repeatedly encounter adverse 
interactions between the side chains of the amino acids within units of secondary 
structure. 

The need for extreme specificity in the sequential arrangements of amino 
acids, discussed in the previous chapter, means that the overwhelming majority 
of amino-acid sequences in units of secondary structure will not result in stable 
folds as these units of structure come into contact with each other. As discussed 
in Chapter 10, the extreme rarity of functional proteins (with stable folds) in 
sequence space ensures that the probability of finding a correct fold-stabilizing 
sequence will be astonishingly small. For this reason, even skilled protein 
scientists have struggled to design sequences that will produce stable protein 
folds.** Almost invariably the units of secondary structure that they attempt to 
combine or otherwise place into stable composite structures will not fold 
because of the interactions of their amino-acid side chains.*? As molecular 
biologist Ann Gauger explains, “Thus, [alpha] helices and [beta] sheets are 
sequence-dependent structural elements within protein folds. You can’t swap 
them around like Lego bricks.”“4 

Nor is it an easy matter to simply find different sequences of amino acids that 
will stabilize folds from smaller secondary units of structure, again, because of 
the extreme rarity of functional (and folding) sequences within amino-acid 
sequence space. Generating specific sequences that will fold into stable 
structures, whether in the lab or during the history of life, requires solving the 
combinatorial inflation problem. Even small folds will require five or six units of 
secondary structure with 10 or so amino acids in each unit, that is, 60 or more 
precisely sequenced amino acids. Modest-size folds will require a dozen or more 
units of secondary structure and 150 to 200 specifically arranged amino acids in 
order to stabilize a fold. Larger protein folds will require many more secondary 
units and specifically arranged amino acids. Since, however, many mission- 
critical functions within even the simplest cell require many folds (of at least 150 
amino acids) working in close coordination, the need to produce proteins of at 
least this length numerous times through the history of life cannot be avoided. 

All this requires searching for a functional needle in a vast haystack of 


combinatorial possibilities. Recall that Douglas Axe estimated the ratio of 
needles (functional sequences) to strands of straw in the haystack (nonfunctional 
sequences) to be 1 to 10’ for sequences of modest-length (150 amino acids). 

Of course, in naturally occurring proteins, the interactions between side 
chains in units of secondary structure do maintain stable folds. But these 
proteins, with their stable three-dimensional folded structures, depend upon 
exceedingly rare and precisely arranged sequences of amino acids. The question 
is not whether the combinatorial search problem necessary to produce stable 
protein folds has ever been solved, but whether a neo-Darwinian mechanism 
relying on random mutations (in this case random shuffling of exons) provides a 
plausible explanation for how it might have been solved. 

The papers that Long cites give no reason to think that exon shuffling (or any 
other mutational mechanism) has solved this problem. The exon-shuffling 
hypothesis ignores the need for side-chain specificity, though the need for such 
specificity has repeatedly defeated attempts in the laboratory to build new 
proteins from units of secondary structure in the manner required. 

But advocates of exon shuffling make no attempt to show how random 
rearrangements of protein domains—whether the domains are conceived of as 
fragments of a fold or whole folds—would solve the combinatorial problem. Nor 
do they challenge Sauer’s or Axe’s experimentally derived quantitative estimates 
of the rarity of functional genes or proteins. They do not challenge the 
probability calculations based on these estimates. And they do not show that a 
mechanism exists that can search amino-acid sequence spaces more effectively 
or efficiently than random mutation and selection. Nor do they demonstrate the 
efficacy of exon shuffling in a model system in the laboratory. Instead, basic 
considerations of protein structure imply the implausibility of exon shuffling as a 
means of generating the genetic information necessary to produce a new protein 
fold.*° So, in the end, with few words and with apparent confidence, advocates 
of exon shuffling simply assert, as the Long paper does, that “exon shuffling 
often recombines sequences that encode various protein domains to create 
mosaic proteins.” 


WORD SALAD THE ASSERTION OF LONG AND HIS COLLEAGUES 
ABOUT EXON SHUFFLING, LIKE MANY OTHER STATEMENTS ABOUT 
POSTULATED MUTATIONAL MECHANISMS, BLURS THE DISTINCTION 

BETWEEN THEORY AND EVIDENCE. DESPITE THE AUTHORITATIVE 
TONE OF SUCH STATEMENTS, EVOLUTIONARY BIOLOGISTS RARELY 


DIRECTLY OBSERVE THE MUTATIONAL PROCESSES THEY ENVISION. 
INSTEAD, THEY SEE PATTERNS OF SIMILARITIES AND DIFFERENCES 
IN GENES AND THEN ATTRIBUTE THEM TO THE PROCESSES THEY 
POSTULATE. YET THE PAPERS THAT LONG CITES OFFER NEITHER 
MATHEMATICAL DEMONSTRATION, NOR EXPERIMENTAL EVIDENCE, 
OF THE POWER OF THESE MECHANISMS TO PRODUCE SIGNIFICANT 
GAINS IN BIOLOGICAL INFORMATION. 


In the absence of such demonstrations, evolutionary biologists have taken to 
offering what one biologist I know calls “word salad”—jargon-laced 
descriptions of unobserved past events—some possible, perhaps, but none with 
the demonstrated capacity to generate the information necessary to produce 
novel forms of life. This genre of evolutionary literature envisions exons being 
“recruited”“° and/or “donated”*’ from other genes or from an “unknown 
source”*®; it appeals to “extensive refashioning’*’ of genes; it attributes 
“fortuitous juxtaposition of suitable sequences”? to mutations or “fortuitous 
acquisition”?! of promoter elements; it assumes that “radical change in the 
structure” of a gene is due to “rapid, adaptive evolution”;°* it asserts that 
“positive selection has played an important role in the evolution”’*’ of genes, 
even in cases when the function of the gene under study (and thus the trait being 
selected) is completely unknown; it imagines genes being “cobbled together 
from DNA of no related function (or no function at all)”;°° it assumes the 
“creation” of new exons “from a unique noncoding genomic sequence that 
fortuitously evolved”;°° it invokes “the chimeric fusion of two genes”;?” it 
explains “near-identical”°* proteins in disparate lineages as “a striking case of 
convergent evolution”;°? and when no source material for the evolution of a new 
gene can be identified, it asserts that “genes emerge and evolve very rapidly, 
generating copies that bear little similarity to their ancestral precursors” because 
they are apparently “hypermutable.”©° Finally, when all else fails, scenarios 
invoke the “de novo origination” of new genes, as if that phrase—any more than 
the others just mentioned—constitutes a scientific demonstration of the power of 
mutational mechanisms to produce significant amounts of new genetic 
information.®*! 

These vague narratives resemble nothing so much as the naming games of 
scholastic philosophers in the Middle Ages. Why does opium put people to 
sleep? Because it has a “dormative” virtue. What causes new genes to evolve so 


rapidly? Their “hypermutability” or perhaps their ability to undergo “rapid, 
adaptive evolution.” How do we explain the origin of two similar genes in two 
separate, but otherwise widely disparate lineages? Convergent evolution, of 
course. What is convergent evolution? The presence of two similar genes in two 
separate, but otherwise widely disparate lineages. How does convergent 
evolution occur, given the improbability of finding even one functional gene in 
sequence space, let alone the same gene arising twice independently? No one 
knows exactly, but perhaps it was a “fortuitous juxtaposition of suitable 
sequences,” or “positive selection,” or “de novo origination.” Need to explain 
two similar genes in more closely related lineages? Try “gene duplication,” or 
“chimerical gene fusion,” or “retropositioning,” or “extensive refashioning of the 
genome,” or some other scientific-sounding combination of words. 

The vagueness of these scenarios raises serious questions about how 
scientists could regard them as decisive demonstrations or refutations of 
anything—let alone refutations of the kind of experimentally based, 
mathematically precise challenges to mutation and selection described in the 
previous chapter. 

So despite the official pronouncement of a federal judge and claims of 
extensive “scientific literature documenting the origin of new genes,” 
evolutionary biologists have not demonstrated how new genetic information 
arises, at least not in amounts sufficient to build protein folds, the crucial units of 
biological innovation. Biologists have not solved the problem of combinatorial 
inflation or refuted the precise quantitative argument against the creative power 
of the selection and mutation mechanism presented in the previous chapter (or in 
my 2004 article). Nor has anyone provided a compelling refutation of Douglas 
Axe’s assessment of the rarity of genes and proteins on which that argument is 
based. 

In fairness, neo-Darwinian biologists have mathematical models of their own 
—models indicating to them that nearly unlimited evolutionary change can occur 
under the right conditions. The assumption that these models, which are based 
on the equations of population genetics, accurately represent how much 
evolution can occur has left many evolutionary biologists confident in the 
creative power of various mutational mechanisms. But should they be? 

In the next chapter, I will take up this question. As I do, I’ll explain why 
evolutionary biologists have been, heretofore, untroubled by mathematical 
challenges to neo-Darwinism. I’ll also show why that has begun to change as 
new developments in molecular genetics have introduced another formidable 


mathematical challenge to the creative power of the neo-Darwinian mechanism 
—a challenge that arises from within the neo-Darwinian framework and raises 
yet new questions about the causal adequacy of the neo-Darwinian mechanism. 


12 


Complex Adaptations and the Neo-Darwinian 
Math 


University of Illinois biologist Tom Frazzetta knew the textbook story as well as 
anyone. According to neo-Darwinian theory, organisms with all their complex 
systems came into existence via natural selection acting on randomly arising, 
small-scale variations and mutations. As Frazzetta understood, this evolutionary 
mechanism necessarily transforms organisms gradually, with modifications 
parceled into increments “as a sort of continuous change, where one structural 
condition melts gradually into another.”! 

Frazzetta had his doubts, however. As an expert in functional biomechanics— 
studying how animals actually work—he had dissected the skulls of rare snakes 
found only on the island of Mauritius, in the Indian Ocean. These snakes, called 
bolyerines, are boa-like but have an anatomical specialization found in no other 
vertebrate. Their maxilla, the tooth-bearing bone of the upper jaw, is divided into 
two segments, linked by a flexible joint and serviced by many specialized 
nerves, extra bones, tissues, and differently arranged ligaments. This unique trait 
allows the snakes to bend the front half of their upper jaw backwards when they 
attack prey (see Fig. 12.1). 

Could this complex system of bones, joints, tissues, and ligaments have 
evolved gradually? “A movable joint dividing the maxilla into two segments,” 
observed Frazzetta, “seems to have either a presence or absence, with no 
intermediate to connect the two conditions.”* That is, either the maxilla occurs 
as one bone (as it does in every other vertebrate) or as two segments with all the 
accompanying joints, bones, ligaments, and tissues necessary to make it work, as 
it does in the bolyerine snakes. No intermediate condition—a broken maxilla 
with two pieces of bone lacking the necessary joints, tissues, and ligaments, for 
example—appears viable. As Stephen Jay Gould asked of the same system, 
“How can a jawbone be half broken?”? Or as Frazzetta himself observed, “I thus 


find it difficult to envision a smooth transition from a single maxilla to the 
divided condition seen in bolyerines.”* Yet because the intermediate forms 
would not be viable, building a bolyerine jaw would require all the necessary 
parts—the jointed maxilla, the adjoining ligaments, and the necessary muscles 
and tissues—arising together. 

Yet the problem for neo-Darwinian theory, Frazzetta realized, extended well 
beyond the anatomical peculiarities of rare snakes. As a young evolutionary 
biology professor, he had studied complex features in a wide variety of species. 
He knew that almost any biological structure of interest—the inner ear, the 
amniotic egg, eyes, olfactory organs, gills, lungs, feathers, the reproductive, 
circulatory, and respiratory systems—possesses multiple necessary components. 
To change such systems requires altering each of the many independent parts 
upon which their functions are based. This cannot be done willy-nilly. For 
example, changing any of the three bones of the mammalian inner ear—the 
incus, stapes, or malleus—will perforce require corresponding changes in the 
other bones and in other parts of the ear as well, such as the tympanic membrane 
or the cochlea. Complex biological systems depend for their functions on tens or 
hundreds of such independent, yet jointly necessary parts. As the number of 
necessary components increases, the requisite number of coordinated changes 
increases too, rapidly driving up the difficulty of maintaining the functional 
integrity of the system while modifying any of its parts. 






_CASAREA 





FIGURE 12.1 

A complex adaptation: the jointed upper jaw of the bolyerine snake, made 
possible by its accompanying tendons, ligaments, and musculature. The other 
skull shows the single-boned jaw, found in other related snakes. 


And that was the problem, as Frazzetta understood it. Any system that 
depends for its function on the coordinated action of many parts could not be 
changed gradually without losing function. But in the neo-Darwinian scheme of 


things, natural selection acts to preserve only functional advantages. Changes 
that result in death or reduced function will not be preserved. The integrated 
complexity of many biological systems thus imposes limitations on the 
evolutionary process—limitations that human engineers do not face when they 
design complex integrated systems. In 1975, Frazzetta wrote a minor classic 
entitled Complex Adaptations in Evolving Populations explaining this concern. 
He wrote: 


When modifying the design of a machine, an engineer is not bound by the 
need to maintain a real continuity between the first machine and the 
modification. . .. But in evolution, transitions from one type to the next 
presumably involve a greater continuity by means of a vast number of 
intermediate types. Not only must the end product—the final machine—be 
feasible, but so must be all the intermediates. The evolutionary problem is, 
in a real sense, the gradual improvement of a machine while it is 
running!° 


Historically, evolutionary biologists tried to solve this problem one 
advantageous variation or mutation at a time. Starting with Darwin himself, they 
have attempted to explain how natural selection and random variation could 
build complex systems as the result of a series of incremental changes, each of 
which might confer some selectable advantage. Darwin famously employed this 
strategy to explain the origin of the eye, asking his readers to imagine a series of 
incremental, advantageous changes to a simple “nerve sensitive to light.”° 

As Frazzetta thought about the problem of explaining the origin of complex 
systems, he came to doubt both the classical and the modern Darwinian accounts 
of such systems. Frazzetta acknowledged that he was influenced in part by the 
skepticism expressed by the Wistar “outsiders” (see Chapter 9). He admitted 
“revealing some hideous personalia” in confessing that he was attracted to the 
worries about neo-Darwinism expressed by Murray Eden and other Wistar 
skeptics.” 

Frazzetta’s concerns about the adequacy of the neo-Darwinian mechanism, 
like Eden’s, turned on the growing appreciation of the nature and importance of 
genetic information. Though biologists then (as now) didn’t fully understand 
how genetic information in DNA correlates or “maps” to these higher-level 
complex morphological structures, by 1975 they did know that many hundreds 
of genes can be involved in coding for a single complex integrated structure. 


Thus, altering the anatomical structure of the mammalian ear or the vertebrate 
eye, for example, would involve altering the genes that code for its constituents, 
which implies, most implausibly, that multiple coordinated mutations would 
occur virtually simultaneously. 

As Frazzetta explained, “Phenotypic alteration of integrated systems requires 
an improbable coincidence of genetic (and hence, heritable phenotypic) 
modifications of a tightly specified kind.”® Yet the extreme specificity of the fit 
of the components and the functional dependence of the whole system on this fit 
imply limits to allowable genetic change. Genetic change affecting any one of 
the necessary components, unless matched by many corresponding—and vastly 
improbable—genetic changes, will result in functional loss and often death. For 
this reason, as Frazzetta concluded, “We are still left with the unabating need to 
explain evolutionary changes in systems that have the operational integration 
characteristic of things we recognize as ‘machines.’ ”? At the time, the doubts he 
expressed gained little traction in the evolutionary biology community, because 
neo-Darwinian evolutionary biologists assumed that mutation and selection had 
nearly unlimited creative power, enough to generate even complex systems of 
the kind described in Frazzetta’s book. 

The mathematical expression of neo-Darwinian theory, as represented in the 
equations of a subdiscipline of biology known as population genetics, seemed to 
confirm this conviction. Population genetics models how gene frequencies 
change as the result of processes such as mutation, genetic drift (neutral changes 
in the genome that natural selection neither favors nor eliminates), and natural 
selection. On the assumption that advantageous variations or traits will arise as 
the result of even single mutations, the mathematical models of population 
genetics describe how much evolutionary change can occur in a given period of 
time. These estimates are based upon, among others, three primary factors: 
mutation rates, effective population sizes, and generation times. When 
evolutionary biologists plug estimates for these factors into the equations of 
population genetics, their calculations seem to imply that standard evolutionary 
mechanisms could generate significant amounts of evolutionary change in many 
groups of organisms—even enough to build complex systems. As long as 
mutations generate a continuous supply of new traits, any system, however 
complex, can be built one trait at a time—trait upon trait—via the creative power 
of natural selection. Or so the story goes. 

Confidence in these mathematical models (and their underlying assumptions) 
led many neo-Darwinists to disregard the need to give detailed accounts of the 


specific evolutionary pathways by which complex systems might have arisen. 
For example, in an evolutionary biology text widely used about the time 
Frazzetta first posed this challenge, evolutionary biologists Paul Ehrlich and 
Richard Holm advised: 


One need not go into the details of the evolution of the bird’s wing, the 
giraffe’s neck, the vertebrate eye, the nest building of some fish, etc., as the 
selective origins of these and other structures and of behavioral patterns 
may be assumed to be basically the same in outline as those, such as 
industrial melanism, which have already been discussed. Even a slight 
advantage or disadvantage in a particular genetic change provides a 
sufficient differential for the operation of natural selection.'° 


The phrase “sufficient differential for the operation of natural selection” refers to 
the equations of population genetics and one of the factors (the so-called 
selection coefficient) that determines how rapidly particular traits would be 
likely to disseminate through a population. The message was clear: the math tells 
the story; the biological details of the origin of complex systems don’t matter. 

The neo-Darwinian focus on mathematical modeling helps to explain why 
mainstream evolutionary biologists haven’t worried about the problem of the 
origin of new genes and proteins or the problem of combinatorial inflation, 
discussed in Chapters 9 and 10. Many contemporary evolutionary biologists, like 
the founders of population genetics, assumed that some mechanism for building 
new genes already existed. Indeed, they assumed that new traits (and the genes 
for building them) can arise as the result of even single mutations (or a series of 
such mutations that each confer a small, incremental, selectable advantage). 
Thus, the mathematical expression of neo-Darwinian theory seemed to certify 
the plausibility of even large-scale evolutionary changes—again, provided these 
changes could occur one mutation at a time. 

But what if there are systems in living organisms that cannot be built one 
mutation at a time, and instead must be built by simultaneous coordinated 
changes? What if building just a single new gene or protein requires such 
coordinated mutational changes? What if individual genes turned out to be 
complex adaptations? 

Mathematical challenges of the kind first advanced at Wistar, and that 
Douglas Axe’s experimental findings have exacerbated, initially did not dent 
confidence in the adequacy of neo-Darwinian explanations. Many evolutionary 


biologists have simply regarded mathematical challenges to the creative power 
of the mechanism, coming as they mostly do from scientists and engineers in 
other fields, as exotic or irrelevant. 

That has begun to change. And it has begun to change in a way that has not 
only introduced a new mathematical challenge to the creative power of the neo- 
Darwinian mechanism, but also in a way that indirectly confirms Axe’s key 
insight about the rarity of genes and proteins. In the last decade, developments in 
molecular genetics and population genetics have exposed a connection between 
the problem of the origin of new genes and proteins and the origin of complex 
adaptations, a connection first perceived by Tom Frazzetta back in 1975. As 
more biologists have recognized that connection, they too have begun to share 
Frazzetta’s doubt. 


POPULATION GENETICS AND THE ORIGIN OF GENETIC 
INFORMATION 


The neo-Darwinian synthesis was formulated during the 1930s before the 
elucidation of the structure of DNA. Biologists at that time did not yet 
understand the nature, structure, or precise location of genetic information." 
They did not associate genes with long strings of nucleotide bases along the 
spine of the DNA molecule. They did not think of genes as long sections of 
digital code stored in complex biomacromolecules. Instead, after Mendel, but 
before Watson and Crick, genes were defined operationally as those entities, 
associated with chromosomes, that produced specific visible or selectable 
anatomical traits, such as eye color or beak shape. 

The architects of neo-Darwinism working in the 1930s reformulated 
evolutionary theory to emphasize the importance of mutations as the source of 
genetic variation. It followed, therefore, that the mutations—which they 
regarded as the source of heritable variation—must operate on genes. Not 
knowing the nature of genes, they also assumed that a single mutation could 
alter a gene in such a way as to produce a new trait. 

The equations of population genetics are predicated upon this assumption. 
The rate of mutation thus emerges as an important factor in computing the 
amount of evolutionary change that can occur in any given population. If every 
individual mutation can produce a new, potentially selectable trait, then the rate 
at which such variation accumulates partially determines how much change can 
occur in a given time. 


After 1953, biologists no longer conceived of the gene as an abstract entity. 
Watson and Crick showed that the gene had a definite locus and structure and 
that individual genes contain hundreds or thousands of precisely sequenced 
nucleotide bases, each functioning as digital characters in a larger instruction set. 
Consequently, biologists changed their understanding of mutations as well. 
Biologists came to understand mutations as something like typographic errors in 
long strings of digital code. As a result, many scientists began to realize that 
individual mutations were unlikely by themselves to produce new beneficial 
traits. Some scientists realized that mutations were instead overwhelmingly more 
likely to degrade the information contained in a gene than to produce a new 
function or trait, and that the accumulation of mutations would eventually and 
typically result in the loss of function. 

This change of perspective called for an explanation of how mutations could 
generate new genes—an explanation that was provided beginning in the 1970s 
with the ideas of gene duplication, subsequent neutral evolution, and positive 
selection. 

Though the theory of gene duplication played no formal role in the 
mathematical structure of population genetics, it did serve to buttress a critical 
assumption of the whole enterprise. After the 1950s, evolutionary biologists no 
longer assumed that single mutations would necessarily generate whole new 
traits. That left a critical assumption of population genetics essentially 
undefended. For many evolutionary biologists, the theory of gene duplication 
closed that conceptual gap. After the theory was formulated, many evolutionary 
biologists thought that a mechanism had been discovered by which sections of 
genetic text could accumulate multiple changes without compromising the 
fitness of an organism, thus ensuring the eventual production of new genes and a 
steady supply of new traits. 

So when Frazzetta confronted evolutionary biology with the problem of 
complex adaptations in the mid-1970s, most neo-Darwinian biologists responded 
with a collective yawn, if they noticed it at all. His challenge was no challenge 
after all. So implied the math of population genetics—provided its assumptions 
about the ease with which new mutations could generate new traits were valid. 

But were they valid? Could a series of separate mutations generate the new 
genes necessary to build new proteins and new traits, or did building genes 
require multiple coordinated mutations? 


ARE GENES COMPLEX ADAPTATIONS? 


Classically, Darwinian biologists have assumed that small, separate step-by-step 
changes could produce all biological structures and features, provided each 
change confers some survival or reproductive advantage. In his chapter in the 
1909 anthology Darwin and Modern Science, the British geneticist William 
Bateson wryly described how this widespread assumption prevented 
evolutionary biologists from confronting the real difficulty of explaining the 
origin of complex adaptations: 


By suggesting that the steps through which an adaptive mechanism arises 
are indefinite and insensible, all further trouble is spared. While it could 
be said that species arise by an insensible and imperceptible process of 
variation, there was clearly no use in tiring ourselves by trying to perceive 
that process. This labour-saving counsel found great favor.' 


One of the first prominent evolutionary biologists to consider the possibility 
that building new genes and proteins might require multiple coordinated 
mutations was John Maynard Smith. Maynard Smith worked as an aeronautical 
engineer during World War II, but then took up the formal study of evolutionary 
biology after the war. He eventually helped to found the University of Sussex, 
where he also served as a distinguished professor of biology until the mid- 
1980s.19 

In 1970, Maynard Smith wrote an article in Nature responding to an earlier 
article by Frank Salisbury, a biologist from Utah State University. Salisbury had 
raised questions about whether random mutations could explain the specificity of 
the arrangement of nucleotide bases necessary to produce functional proteins. 
Salisbury worried, following discussions at Wistar, that the probability of 
random mutations generating functional arrangements of bases or amino acids 
was prohibitively low. According to Salisbury’s calculations, “The mutational 
mechanism as presently imagined could fall short by hundreds of orders of 
magnitude of producing, in a mere four billion years, even a single required 
gene.”!4 

To overcome this improbability, Maynard Smith proposed a model of protein 
evolution. While admitting that the origin of the first proteins remained a 
mystery, he suggested that one protein could evolve into another as the result of 
small incremental changes in amino-acid sequences, provided each sequence 
maintained some function at each step along the way. Maynard Smith compared 
protein-to-protein evolution to changing one letter in an English word in order to 


generate a different word (while at each step generating a different meaningful 
word). He used this example to convey how he thought protein evolution might 
work: 


WORD — WORE — GORE — GONE — GENE 
He explained: 


The words [in this analogy] represent proteins; the letters represent amino 
acids; the alteration of a single letter corresponds to the simplest 
evolutionary step, the substitution of one amino acid for another; and the 
requirement of meaning corresponds to the requirement that each unit step 
in evolution should be from one functional protein to another.!° 


As a self-professed “convinced Darwinist,” Maynard Smith realized that 
natural selection and random mutation could only build new biological structures 
from preexisting structures if each intermediate structure along the way 
conferred some adaptive advantage. He thought that this requirement applied as 
much to the evolution of new genes and proteins as it did to the evolution of new 
phenotypic traits or larger-scale anatomical structures.!° 

Nevertheless, the essentially digital or alphabetic character of the genetic 
information that directs protein synthesis suggested a problem to Maynard 
Smith. How, he asked, could one gene or protein evolve into another if such a 
transformation required multiple simultaneous changes in the bases of the 
genetic text (or arrangement of amino acids)? If building new genes required 
multiple coordinated mutations, then the probability of generating a new gene or 
protein would drop precipitously, since such a transformation would require not 
just one improbable mutational event, but two or three or more, occurring more 
or less at once. Here’s how he described the potential problem: 


Suppose that a protein ABCD . . . exists, and that a proteinabCD... 
would be favoured by selection if it arose. Suppose further that the 
intermediates aBCD .... and AbCD . . . are nonfunctional. These forms 
would arise by mutation, but would usually be eliminated by selection 
before a second mutation could occur. The double step from abCD . . . to 
ABCD would thus be very unlikely to occur.!” 


In Maynard Smith’s view, the improbability associated with “double-step” or 
multiple-step coordinated mutations presented a significant potential problem for 
molecular evolution. In the end, however, he concluded that such mutations were 
so improbable that they must not have played a significant role in the evolution 
of novel structures. As he explained, “Such double steps . . . may occasionally 
occur, but are probably too rare to be important in evolution.” !® 

For several decades, the problem he flagged receded into obscurity. As 
biochemist H. Allen Orr pointed out in 2005 in the journal Nature Reviews 
Genetics, “Although Maynard Smith’s work appeared early in the molecular 
revolution,” his ideas about problems facing protein evolution “were almost 
entirely ignored for two decades.”!9 Thus, Orr noted that evolutionary biologists 
stopped thinking about molecular evolution as a consequence of adaptive 
changes at the amino-acid level. Not until the first decade of the twenty-first 
century would biologists confront the challenge of making a rigorous 
quantitative analysis of the plausibility of protein-to-protein evolution. 


WAITING FOR COMPLEX ADAPTATIONS 


In 2004, Lehigh University biochemist Michael Behe (see Fig. 12.2), introduced 
briefly at the end of Chapter 9, and University of Pittsburgh physicist David 
Snoke published a paper in the journal Protein Science that returned to the 
problem first described by Maynard Smith.*° By this time, Behe had established 
himself as a prominent critic of neo-Darwinism by arguing that the neo- 
Darwinian mechanism did not provide an adequate explanation for the origin of 
functionally integrated “irreducibly complex” molecular machines. In his 2004 
paper, Behe sought to extend his critique of neo-Darwinism by assessing its 
adequacy as an explanation for new genes and proteins. He and Snoke attempted 
to assess the plausibility of protein evolution in the case that it does indeed 
require multiple coordinated mutations. They applied standard neo-Darwinian 
modes of analysis derived from population genetics to make their evaluation. 
They considered the plausibility of the main neo-Darwinian model of gene 
evolution in which evolutionary biologists envision new genes arising by gene 
duplication and subsequent mutations in the duplicated gene. 

Behe and Snoke assessed the plausibility of this model for multicellular 
organisms in the case that multiple (two or more) point mutations must occur 
simultaneously in order to generate a new selectable gene or protein. Whereas 
Maynard Smith saw the need for multiple coordinated mutations as a potential 
problem, one that ultimately needn’t trouble evolutionary biologists, Behe and 
Snoke argued that evolutionary biologists do need to worry about it, and they 
quantified its severity. 

Behe and Snoke first noted that many proteins, as a condition of their 
function, require unique combinations of amino acids interacting in a 
coordinated way. For example, ligand binding sites on proteins—places where 
small molecules bind to large proteins to form larger functional complexes— 
typically require a combination of several amino acids. Behe and Snoke argued 
that in such cases the combinations of amino acids would have to arise in a 
coordinated fashion since the capacity for ligand binding depends on all the 
necessary amino acids being present together. In support of this inference, they 
cited an authoritative textbook, Molecular Evolution, by University of Chicago 
evolutionary biologist Wen-Hsiung Li. In it, Li notes that evolving ligand 
binding capacity in proteins such as hemoglobin may require “many mutational 
steps,”*! even though the first steps on the way to building such capacity would 


confer no selective advantage. As Li explains, “Acquiring a new function may 
require many mutational steps, and a point that needs emphasis is that the early 
steps might have been selectively neutral [non-advantageous] because the new 
function might not be manifested until a certain number of steps had already 
occurred.”2 








FIGURE 12.2 
Michael Behe. Courtesy Laszlo Bencze. 


Behe and Snoke point out that this observation implies that a series of 
separate mutations could not generate a ligand binding function in a protein that 
previously did not have this capacity, since individual amino-acid changes would 
initially confer no selectable advantage on the protein lacking this function. 
Instead, evolving ligand binding capability would require multiple coordinated 
mutations. Behe and Snoke make a similar argument about the requirements for 
the evolution of protein-to-protein interactions. They note that for proteins to 
interact with each other in specific ways, typically at least several individually 
necessary amino acids must be present in combination in each protein, again, 


suggesting the need for multiple coordinated mutations. 


SO MANY CHANGES, SO LITTLE TIME 


Behe and Snoke used the principles of population genetics to assess the 
likelihood of various numbers of coordinated mutational changes occurring in a 
given period of time. They asked: Is it probable that there was enough time in 
evolutionary history to generate coordinated mutations? If so, how many 
coordinated mutations is it reasonable to expect in a period of time given various 
population sizes, mutation rates, and generation times? Then, for different 
combinations of these various factors, they assessed how long it would typically 
take to generate two or three or more coordinated mutations. They determined 
that generally the probability of multiple mutations arising in close (functionally 
relevant) coordination to each other was “prohibitively” low—it would likely 
take an immensely long time, typically far longer than the age of the earth. 


THE POWERBALL LOTTERY—POPULATION GENETICS MADE 
EASY 


Before going on, it might be helpful to understand a bit more about how the 
equations and principles of population genetics can be used to calculate what 
evolutionary biologists call “waiting times,” the expected time that it will take 
for a given trait to arise by various evolutionary processes. In his book The Edge 
of Evolution, Michael Behe illustrates these principles using a charming analogy 
to the Powerball lottery game that many American state governments use to raise 
money. 

To win at Powerball, contestants must purchase tickets with six numbers that 
match the numbers printed on six balls drawn from two drums. Five of the balls 
are selected from a drum containing 59 white balls, numbered 1 through 59. A 
sixth red ball, the so-called power ball, is chosen out of a drum of 35 red balls 
numbered 1 to 35. To win the jackpot—which can exceed $100 million—a 
player must purchase a ticket listing all six of the chosen numbers in any order. 
The Powerball website lists the probability of matching all six balls at roughly 1 
in 175 million. Depending on how many tickets have been purchased and how 
frequently drawings occur, it may take a very long time for someone to win. 

Behe asked his readers first to consider how long it will take, on average, to 
generate a lottery ticket with the winning numbers. He notes that knowing the 


probability of drawing such a winning ticket isn’t sufficient. The calculation also 
requires knowing how often drawings occur and how many tickets are sold. As 
Behe explains: “If the odds of winning are one in a hundred million, and if a 
million people play every time, then it will take on average about a hundred 
drawings for someone to win.” If there are about a hundred drawings per year, 
with a million people playing per drawing, “then it would take about a year 
before someone won. But if there were only one drawing per year, on average it 
would take a century to hit the jackpot.”*? More frequent drawings produce 
shorter waiting times. Less frequent drawings tend to require longer waiting 
times. Similarly, more players will decrease the average time necessary to 
produce a winner, while fewer players result in longer waits. 

Similar mathematical principles apply when calculating the expected waiting 
times for the evolution of biological features by mutation and selection. 
Biologists first need to assess the complexity of the system—or its inverse, the 
improbability of the feature occurring. As in Powerball, however, knowing the 
probability of an event by itself does not allow someone to calculate how long it 
will likely take for that event to occur. Such a calculation requires also knowing 
the size of the population (equivalent to how many people are playing 
Powerball) and how frequently new genetic sequences arise (equivalent to how 
frequently drawings are held). 

In Powerball, a new sequence of numbers arises in every drawing. But when 
organisms reproduce, they do not always generate a new sequence of nucleotide 
bases in their individual genes. For this reason, to calculate the rate at which new 
sequences arise in living organisms requires knowing two factors: the generation 
time and the mutation rate. More rapid rates of mutation and/or shorter 
generation times will increase the rate at which new genetic sequences arise, 
resulting in shorter waiting times. Slower rates of mutation and/or longer times 
between generations produce longer waiting times. Also, as in Powerball, the 
number of “players” is important. Larger populations generate new genetic 
sequences more frequently than smaller ones and, thus, decrease expected 
waiting times. Smaller populations reduce the rate at which new sequences are 
generated, increasing waiting times. 

Now, under the rules of Powerball, you can “win” without picking all six 
numbers correctly—you just won’t win the entire jackpot.** If you pick just the 
red “power ball” correctly, you win $4. Pick three white balls correctly, and you 
win $7. If you correctly pick the numbers of four white balls, you win $100. 
Guess all five white balls correctly (but not the red ball), and you can win a cool 


$1 million. 

With each additional ball necessary to secure a new level of winnings, the 
probability of winning decreases exponentially, while the values of the prizes 
increase dramatically. The Powerball website lists the probability of winning a 
$4 prize at just 1 in 55, the probability of winning $1 million dollars at roughly 1 
in 5 million, and the probability of winning the jackpot as 1 in 175 million (see 
Fig. 12.3).?° 




















NUMBER OF BALLS 
WHICH MUST BE ODDS OF WINNING PRIZE 
GUESSED CORRECTLY 
1 (RED) 1IN 55.41 $4 
3 (ALL WHITE) 1IN 360.14 $7 
4 (ALL WHITE) 1 IN 19,087.53 $100 
5 (4 WHITE, 1 RED) 1IN 648,975.96 $10,000 
5 (ALL WHITE) 1 IN §,153,632.65 $1,000,000 
FIGURE 12.3 


A chart showing the probability of winning and the corresponding payouts for 
different combinations of balls in the Powerball lottery game. 


Neo-Darwinists have long assumed that biological evolution works 
something like matching one number in Powerball. In their view, natural 
selection acts to reward or preserve small but relatively probable changes in gene 
sequences—like winning the small but more likely $4 prize in Powerball over 
and over again. They assume the mutation and selection mechanism doesn’t 
depend on winning extremely unlikely “prizes” (like the whole Powerball 
jackpot) all at once. 

But what if, to produce a functional advantage at the genetic level, the 
mutation and selection mechanism had to generate the biological equivalent of 
all six (or more) correct balls in the Powerball lottery with no reward for 
guessing a smaller number of balls correctly first? Clearly, the probability of this 
would be extremely small. And the waiting time for winning such a lottery could 
become prohibitively long. 


BACK TO THE BIOLOGY 


That brings us back to Behe and Snoke’s conclusion. In their 2004 paper, they 
argued that generating a single new protein will often require many improbable 
mutations occurring at once. They took into account the improbability of 
multiple functionally necessary mutations appearing together—the equivalent of 
needing to get a Powerball ticket matching several numbers to win any money at 
all. Then they sought to determine how long it would take and/or how large the 
population sizes would need to be to generate a new gene via multiple 
coordinated mutational changes—the genetic equivalent of the “jackpot 
scenario.” 

Behe and Snoke found that if generating a new gene required multiple 
coordinated mutations, then the waiting time would grow exponentially with 
each additional necessary mutational change. They also assessed how population 
sizes affected how long it would take to generate new genes, if multiple 
coordinated mutations were necessary to produce those genes. They found, not 
surprisingly, that just as larger populations diminished expected waiting times, 
smaller populations dramatically increased them. 

More important, they found that even if building a new gene required just two 
coordinated mutations, the neo-Darwinian mechanism would likely either 
require huge population sizes or extremely long waiting times or both. If 
coordinated mutations were necessary, then evolution at the genetic level faced a 
catch-22: for the standard neo-Darwinian mechanism to generate just two 
coordinated mutations, it typically either needed unreasonably long waiting 
times, times that exceeded the duration of life on earth, or it needed 
unreasonably large population sizes, populations exceeding the number of 
multicellular organisms that have ever lived. To get population sizes that were 
reasonable, they had to have waiting times that were unreasonable. To get 
waiting times that were reasonable, they had to have population sizes that were 
unreasonable. As they put it, either way the “numbers seem prohibitive.”7© 

Behe and Snoke found that mutation and selection could generate two 
coordinated mutations in a mere 1 million generations, a reasonable length of 
time given the age of the earth. But that was only in a population of 1 trillion or 
more multicellular organisms, a number that exceeds the size of the effective 
breeding populations of practically all individual animal species that have lived 
at any given time.*’ Conversely, they found that mutation and selection could 


generate two coordinated mutations in a population of only 1 million organisms, 
but only if the mechanism had 10 billion generations at its disposal. Yet on the 
assumption that each multicellular organism lived only one year, 10 billion 
generations computes to 10 billion years—more than twice the age of the earth. 
This is clearly an unreasonable length of time to wait for the emergence of a 
single gene, let alone more significant evolutionary innovations. 

Behe and Snoke did, however, find one tiny “sweet spot” in which a gene 
requiring only two coordinated mutations could arise (see Fig. 12.4). Such a 
gene could conceivably arise from 1 billion organisms in a “mere” 100 million 
generations. Since many more than 1 billion multicellular organisms have lived 
on earth during its history and since multicellular life on earth has existed for 
more than 500 million years, these numbers offer (assuming, again, one year per 
generation) the prospect of enough time and organisms to generate one new gene 
—if only two coordinated mutations are necessary. (Of course, if the population 
evolving a two-mutation trait had fewer than 1 billion organisms, then the 
waiting times again increased to unreasonable lengths.) 
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FIGURE 12.4 

This diagram shows the population sizes and times (measured in number of 
generations) required to produce a gene or trait if building that gene or trait 
requires multiple coordinated mutations. The shaded gray area shows the “sweet 
spot”—population sizes and available time sufficient to generate the coordinated 
mutations necessary to produce a new gene. Note that any multi-mutation feature 
requiring more than two mutations could not, in all probability, evolve by gene 
duplication and subsequent coordinated mutation in a population of multicellular 


organisms, however large. Note also that for most normal population sizes and 
reasonable generation times, even evolving two mutations lies beyond the reach 
of gene duplication, mutation, and selection. Courtesy John Wiley and Sons and 
The Protein Society. 


Nevertheless, these numbers only apply to the case in which only two 
coordinated mutations are necessary to build a new gene. Behe and Snoke found 
that if generating a new functional gene or trait required more than two 
coordinated mutations, then excessively long waiting times were necessary 
regardless of the size of the population. If three or more coordinated mutations 
were necessary, their calculations generated no “sweet spots” at all. Thus, they 
concluded that “the mechanism of gene duplication and point mutation alone 
would be ineffective, at least for multicellular . . . species.”78 

In summary, Behe and Snoke applied the principles derived from population 
genetics to evaluate the creative power of the standard neo-Darwinian model of 
gene evolution. They showed that the standard model encounters clear 
probabilistic limits if the structures it needs to build require more than two 
coordinated mutations in multicellular eukaryotic organisms. 


THE EDGE OF EVOLUTION AND ITS CRITICS 


Behe and Snoke are well-known critics of the creative power of the neo- 
Darwinian mechanism, so their conclusion might seem suspect to some 
observers. Nevertheless, evolutionary biologists attempting to defend the 
creative power of the neo-Darwinian mechanism have inadvertently confirmed 
Behe and Snoke’s conclusions. 

Two recent scientific publications tell this story. First, in 2007, Michael Behe 
published a book, The Edge of Evolution, amplifying the results of his 2004 
paper with David Snoke. Using public-health data about a genetic trait— 
resistance to the antimalarial drug chloroquine in the one-celled organism that 
causes malaria—Behe provided another line of evidence and argument to 
support the conclusion that multiple coordinated mutations are often necessary 
to produce even minor genetic adaptations. 

Based on public-health data, Behe determined that resistance to chloroquine 
only arises once in every 10°? malaria-causing cells. Behe inferred, by working 
the problem backwards, that the trait probably required multiple—though not 
necessarily coordinated—mutations to develop. He called this trait a 


“chloroquine complexity cluster,” or a “CCC.”29 Behe wanted to explore what 
he called the “edge of evolution,” the limits to the creative power of mutation 
and selection at the genetic level. Having established that this trait could arise by 
random mutation in a reasonably short period of time, he wondered how much 
time would be required to produce traits of greater complexity in populations of 
various sizes. 

He asked his readers to consider a hypothetical genetic trait twice as complex 
as a CCC cluster—a feature requiring the origin of two coordinated traits, each 
as complex as a CCC. In other words, Behe wondered how long it would take to 
develop a hypothetical trait that required two genetic changes as complex as a 
chloroquine complexity cluster, if both changes had to occur together in the same 
organism—in a coordinated fashion—in order to produce the trait. He then 
showed, using the principles of population genetics, that multi-mutation traits of 
that complexity—the molecular equivalent of two coordinated CCCs—would 
require many more organisms or vastly more time than was reasonable given the 
history of life. Remember the Powerball lottery: waiting times increase 
exponentially with each additional coordinated change or winning element 
needed. Behe showed, for example, that if 10° organisms were required to 
obtain one CCC, then the square of that amount—10*? organisms—would be 
required to evolve a trait that required two coordinated CCCs before providing 
any advantage.°° But, as we saw in Chapter 10, only 10%? total organisms have 
ever existed on earth, implying that the entire history of the earth would barely 
provide enough opportunities to generate a trait of this complexity.*! 

Similarly, Behe reasoned that for organisms in smaller population sizes, 
developing a trait of twice the complexity of a CCC would require immensely 
long waiting times. He also determined that exceedingly long waiting times are 
typically required to generate even less complex genetic adaptations in smaller 
populations. 

Behe showed that the problem of coordinated mutations was particularly 
acute for longer-lived organisms with small population sizes—organisms such as 
mammals or, more specifically, human beings and their presumed prehuman 
ancestors. Behe estimated, based upon relevant mutation rates, known human 
population sizes, and generation times, the time required for two coordinated 
mutations to occur in the hominid line. He calculated that producing even such a 
modest evolutionary change would require many hundreds of millions of years. 
Yet, humans and chimps are thought to have diverged from a common ancestor 
only 6 million years ago. Behe’s calculation implied that the neo-Darwinian 


mechanism does not have the capacity to generate even two coordinated 
mutations in the time available for human evolution—and thus does not explain 
how humans arose. 

Here the story gets really interesting. Soon after the publication of The Edge 
of Evolution, two Cornell University mathematical biologists, Rick Durrett and 
Deena Schmidt, both defenders of neo-Darwinism, attempted to refute Behe’s 
conclusion by making their own calculations. Their paper, “Waiting for Two 
Mutations: With Applications to Regulatory Sequence Evolution and the Limits 
of Darwinian Evolution,” also applied a model based upon population genetics 
to calculate the amount of time necessary to generate two coordinated mutations 
in the hominid line. Although they calculated a shorter waiting time then Behe 
did, their result nevertheless underscored the implausibility of relying on the 
neo-Darwinian mechanism to generate coordinated mutations during the relevant 
evolutionary timescale. Their calculation suggested that it would take not several 
hundred million years, but “only” 216 million years to generate and fix two 
coordinated mutations in the hominid line—more than thirty times the amount of 
time available to produce humans and chimps and all their distinctive complex 
adaptations and differences from their inferred common ancestor. 

In seeking to refute Behe, Durrett and Schmidt inadvertently confirmed his 
main contention. As they acknowledged, their calculation implies that generating 
two or more coordinated mutations is “very unlikely to occur on a reasonable 
timescale.”** In sum, calculations performed by both critics and defenders of 
neo-Darwinian evolution now reinforce the same conclusion: if coordinated 
mutations are necessary to generate new genes and proteins, then the neo- 
Darwinian math itself, as expressed in the principles of population genetics, 
establishes the implausibility of the neo-Darwinian mechanism. 


TESTING THE CO-OPTION OPTION 


But does generating novel genes and proteins require coordinated mutations? 
Behe and Snoke inferred as much based upon an undisputed fact of molecular 
biology: many proteins rely on sets of amino acids acting in close coordination 
in order to perform their functions. In addition, in The Edge of Evolution, Behe 
argued on functional grounds that many complex biological systems would 
require coordinated adaptive mutations since in these systems, the absence of 
even one or a few gene products (proteins or traits) will cause them to lose 
function. Behe specifically showed that several molecular machines within cells 


(such as the cilium and intraflagellar transport system, and the bacterial flagellar 
motor”’) require the coordinated interaction of multiple protein parts in order to 
maintain their function. Nevertheless, in making this argument, Behe did not 
address an alternative idea about the pathway by which new genes and proteins 
might have evolved, and thus did not establish conclusively that new genes and 
proteins themselves represent complex adaptations. 

Some neo-Darwinists have proposed a model of protein evolution known as 
“co-option.” In this model, a protein that performs one function is transformed, 
or “co-opted” to perform some other function. This model envisions new 
features requiring multiple “mutations” arising in a step-by-step way to produce 
some protein, call it “Protein B,” from some other protein that lacked those 
features, call it “Protein A.” In proposing a series of single separate mutations, 
advocates of co-option acknowledge that the initial individual amino-acid 
changes, the first few steps in evolution, from Protein A, the protein lacking the 
multisite feature, would not allow Protein A to perform the function of Protein 
B. Nevertheless, they propose that these initial changes might have allowed 
Protein A to perform some other advantageous function, thus making it 
selectable and preventing protein evolution from terminating due to diminution 
or loss of its initial function. Eventually, as mutations continued to generate new 
proteins with slightly different functions, they would have generated a protein 
close enough in sequence and structure that just one or a very few additional 
changes would suffice to convert it into Protein B. 

Aware of these imaginative scenarios, Douglas Axe and his colleague, 
molecular biologist Ann Gauger (see Fig. 12.5), now working together at the 
Biologic Institute in Seattle, decided to put them to an ingenious experimental 
test.°* In so doing, they sought to determine whether the evolution of new 
multisite features does indeed typically require multiple coordinated mutations, 
or instead whether such a feature could arise by co-option. 








FIGURE 12.5 
Ann Gauger. Courtesy Laszlo Bencze. 


Axe and Gauger scoured protein databases looking for proteins that are as 
similar as possible in sequence and structure, but that nevertheless perform 
different functions. They identified two proteins that meet those criteria. One of 
these proteins (Kbl,) is needed for breaking down an amino acid called 


threonine, and the other (BioF,) is needed for building a vitamin called biotin. 


(See Fig. 12.6.) 
Gauger and Axe realized that if they could transform Kbl, into a protein 


performing the function of BioF, with just one or very few coordinated amino- 


acid changes, then that might demonstrate (depending upon how few) that the 
two proteins were close enough in sequence that a conversion in function of the 
kind envisioned by co-option advocates is plausible in evolutionary time. What’s 
more, because they knew the difficulty scientists have had in showing any real 
change of protein function to be feasible, a positive result would suggest that 
they had at last discovered a functional gap that one or very few mutations could 
plausibly jump—as co-option envisioned. 

If, however, they found that many coordinated mutational changes were 


needed, then that could establish—depending upon how many were needed— 
that the Darwinian mechanism could not accomplish the functional jump from A 
to B in a reasonable time. That would imply that an even greater degree of 
structural similarity between proteins would be needed for the co-option 
hypothesis to be plausible. Having carefully examined the structural similarities 
between members of a large class of structurally similar enzymes, they knew that 
Kbl, and BioF, were about as close in sequence and structure as any two known 


proteins that performed different functions. Thus, if it turned out that converting 
one protein function into the other required many coordinated mutations—more 
than could be expected to occur in a reasonable time—then the outcome of their 
experiment would have devastating implications for standard accounts of protein 
evolution. If proteins that perform two different functions have to be even more 
similar than Kbl, and BioF, in order for mutational changes to convert the 


function of one to the other, then for all practical purposes co-option would not 
work. There simply aren’t many known jumps that small. 








FIGURE 12.6 
The proteins, Kbl, (left) and BioF, (right) are enzymes that use similar catalytic 


mechanisms to accelerate different chemical reactions in the bacterium FE. coli. 
Courtesy Ann Gauger and Douglas Axe. 


Axe and Gauger first identified those amino-acid sites that were most likely, 
if mutated, to cause a change from Kbl, function to BioF, function. They then 
systematically mutated those sites individually and in groups involving various 
amino-acid combinations. Their results were unambiguous. They found that they 
could not induce, with either one or a small number of amino acids, the change 
in function they sought. In fact, they found that they could not get Kbl, to 
perform the function of BioF,, even if they mutated larger numbers of amino 
acids in concert—that is, even if they made many more coordinated mutations 


than could plausibly occur by chance in all of evolutionary history. 
Although their attempts to convert Kbl, to perform the function of BioF, 


failed, their experiment did not. It allowed them to establish experimentally for 
the first time that the co-option hypothesis of protein evolution lacks credibility 
—simply too many coordinated mutations would be required to convert one 
protein function to another, even in the case of extremely similar proteins. That 
implied that generating new genes and proteins would require multiple 
coordinated mutations, and thus, the waiting times that Behe and Snoke had 
calculated do present a problem for neo-Darwinian theory. 

The experimental work also enabled Axe to calculate expected waiting times 
for various numbers of coordinated mutations given different variables and 
factors. Axe developed a refined population-genetics mathematical model to 
calculate various waiting times. His results roughly confirmed the previous 
calculations of Behe and Snoke. He found, for example, that if he took into 
account the probable fitness cost to an organism of carrying unnecessary gene 
duplicates (as was necessary to give the evolution of a new gene a reasonable 
chance), that the probable waiting time for even three coordinated mutations 
exceeded the duration of life on earth. 

He therefore effectively determined an upper bound of two for the number of 
coordinated mutations that could be expected to occur in a duplicate gene during 
the history of life on earth (taking into account the negative effects of carrying 
gene duplicates in the evolutionary process). He also calculated six coordinated 
mutations as an upper bound, neglecting the fitness cost of carrying gene 
duplicates. Nevertheless, in their experiments, he and Gauger could not induce a 
functional change in a single gene with more than six coordinated mutations. So, 
even that more generous—and, again, unrealistically generous upper bound— 
does little to render the co-option hypothesis credible. Indeed, Axe and Gauger’s 
experiments showed that the smallest realistically conceivable step exceeded 
what was plausible given the time available to the evolutionary process. In their 
words, “evolutionary innovations requiring that many changes . . . would be 
extraordinarily rare, becoming probable only on timescales much longer than the 
age of life on earth.” 


WHAT IT ALL MEANS 


By showing the implausibility of the co-option model of protein evolution and 
the need for multiple coordinated mutations in order to generate multisite 
features in proteins, Axe and Gauger confirmed that genes and proteins 
themselves represent complex adaptations—entities that depend upon the 
coordinated interaction of multiple subunits that must arise as a group to confer 
any functional advantage. 

The need for coordinated mutations means that evolutionary biologists cannot 
just assume that mutations will readily generate new genes and traits, as neo- 
Darwinists have long presupposed. Indeed, by applying mathematical models 
based on the standard principles of population genetics to the questions of the 
origin of genes themselves, Behe and Snoke, Durrett and Schmidt 
(inadvertently), Axe and Gauger, and other biologists*’ have recently shown that 
generating the number of multiple coordinated mutations needed to produce 
even one new gene or protein is unlikely to occur within a realistic waiting time. 
Thus, these biologists establish the implausibility of the neo-Darwinian 
mechanism as a means of generating new genetic information. 

There is one other aspect of this. The body of work published between 2004 
and 2011 also provides additional confirmation of Axe’s research showing the 
rarity of genes and proteins in sequence space. In fact, that research helps to 
explain why such long waiting times are necessary. If functional sequences are 
rare in sequence space, it stands to reason that finding them by purely random 
and undirected means will take a long time. Moreover, waiting times increase 
exponentially with each additional necessary mutation. Thus, long waiting times 
for the production of new functional genes and proteins is exactly what we 
should expect if indeed functional genes and proteins are rare, and if coordinated 
mutations are necessary to produce them. Thus, the various experiments and 
calculations performed between 2004 and 2011 indirectly confirm Axe’s earlier 
conclusion about the rarity of functional genes and proteins and supply further 
evidence that the neo-Darwinian mechanism cannot generate the information 
necessary to build new genes, let alone a new form of animal life, in the time 
available to the evolutionary process. 


THE MATH AND THE MECHANISM 


There is a concluding irony in all this. The researchers calculating waiting times 
for the appearance of complex adaptations have in each case done so using 
models based on the core principles of population genetics, the mathematical 
expression of neo-Darwinian theory. In a real sense, therefore, the neo- 
Darwinian math is itself showing that the neo-Darwinian mechanism cannot 
build complex adaptations—including the new information-rich genes and 
proteins that would have been necessary to build the Cambrian animals. To adapt 
a metaphor that Tom Frazzetta might appreciate, the snake has eaten its own tail. 


13 
The Origin of Body Plans 


Rarely have the implications of a Nobel Prize—winning scientific discovery 
received so little notice. Of course, the discovery itself received great acclaim. 
But the deeper meaning was another matter. 

Starting in the autumn of 1979, at the European Molecular Biology 
Laboratory in Heidelberg, two venturesome young geneticists, Christiane 
Niisslein-Volhard and Eric Wieschaus (see Fig. 13.1), generated thousands of 
mutations to investigate the genomes of tens of thousands of fruit flies (species: 
Drosophila melanogaster). They hoped to get them to divulge the secrets of 
embryological development. In technical jargon, Niisslein-Volhard and 
Wieschaus performed “saturation mutagenesis” experiments. After feeding male 
flies the potent mutation-causing chemical (i.e., mutagen) ethyl methane 
sulphonate (EMS), Ntisslein-Volhard and Wieschaus bred the males with virgin 
females. They then examined the offspring larvae for visible defects. 

In generating many thousands of mutants, thereby “saturating” the 
Drosophila genome, Niisslein-Volhard and Wieschaus induced variations in the 
small subset of genes that specifically regulate embryonic development. These 
regulatory genes normally control the expression of many other genes that build 
the fly embryo, progressively subdividing it into regions that will become the 
head, thorax, and abdomen of the adult fly. The EMS mutagen disrupts DNA 
replication, thereby mutating genes. These mutations affect the process of 
development, leaving visible defects in the fly larvae. By observing the damaged 
larvae, Wieschaus and Niisslein-Volhard inferred how specific genes regulate the 
development of different parts of the fly body plan. In essence, Wieschaus and 
Niisslein-Volhard reverse-engineered the fly’s genome to determine the function 
of the different genes, including the regulatory genes crucial to fly 
development.! 

The thoroughness and novelty of the “Heidelberg screens” (as the 


experiments came to be known) and their importance for revealing the 
mechanisms of regulatory control during animal embryogenesis won the 
attention of the Nobel committee. In 1995, the committee awarded the Nobel 
Prize in Medicine or Physiology to Niisslein-Volhard and Wieschaus. “This work 
was revolutionary,” University of Cambridge geneticist Daniel St. Johnston 
explained, “because it was the first mutagenesis in any multicellular organism 
that attempted to find most or all of the mutations that affect . . . the essential 
patterning genes that are used throughout development.” 

That’s the story as it is usually told. And it’s correct, as far as it goes. But the 
mutant fruit flies obtained by Niisslein-Volhard and Wieschaus tell another story 
—one less widely known, but one containing important clues for the unsolved 
mystery of the origin of animal body plans. 

Wieschaus himself alluded to these clues in a memorable interaction at the 
1982 meeting of the American Association for the Advancement of Science 
(AAAS). After a session on the processes of macroevolution in which Wieschaus 
had presented a paper, one audience member asked him what he meant by the 
term “strong” as he used it to describe the mutations he and Niisslein-Volhard 
had induced in flies. Wieschaus explained with a laugh that “strong” certainly 
did not mean alive. Without exception, the mutants he studied perished as 
deformed larvae long before achieving reproductive age. “No, dead is dead,” he 
joked, “and you can’t be more dead.”° 

Another questioner then asked Wieschaus about the implications of his 
findings for evolutionary theory. Here Wieschaus responded more soberly, 
wondering aloud about whether his collection of mutants offered any insights 
into how the evolutionary process could have constructed novel body plans. 
“The problem is, we think we’ve hit all the genes required to specify the body 
plan of Drosophila,” he said, “and yet these results are obviously not promising 
as raw materials for macroevolution. The next question then, I guess, is what are 
—or what would be—the right mutations for major evolutionary change? And 
we don’t know the answer to that.”4 

Thirty years later, developmental and evolutionary biologists still don’t know 
the answer to that question. At the same time, mutagenesis experiments—on 
fruit flies as well as on other organisms such as nematodes (roundworms), mice, 
frogs, and sea urchins—have raised troubling questions about the role of 
mutations in the origin of animal body plans. If mutating the genes that regulate 
body-plan construction destroy animal forms as they develop from an embryonic 
state, then how do mutations and selection build animal body plans in the first 


place? 

The neo-Darwinian mechanism has failed to explain the generation of new 
genes and proteins needed for building the new animal forms that arose in the 
Cambrian explosion. But even if mutation and selection could generate 
fundamentally new genes and proteins, a more formidable problem remains. To 
build a new animal and establish its body plan, proteins need to be organized 
into higher-level structures. In other words, once new proteins arise, something 
must arrange them to play their parts in distinctive cell types. These distinctive 
cell types must, in turn, be organized to form distinctive tissues, organs, and 
body plans. This process of organization occurs during embryological 
development. Thus, to explain how animals are actually built from smaller 
protein components, scientists must understand the process of embryological 
development. 
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FIGURE 13.1 
Figure 13.1a (left): Christiane Nisslein-Volhard. Courtesy Wikimedia Commons, user Rama. Figure 13.1b 
(right): Eric F. Wieschaus. 


THE ROLE OF GENES AND PROTEINS IN ANIMAL DEVELOPMENT 


As much as any other subdiscipline of biology, developmental biology has raised 
disquieting questions for neo-Darwinism. Developmental biology describes the 
processes, called ontogeny, by which embryos develop into mature organisms. 
Within the past three decades the field has dramatically advanced our 
understanding of how body plans arise during ontogeny. Much of this new 
knowledge has come from studying so-called model systems—organisms that 
biologists can easily mutate in the lab, such as the fruit fly Drosophila and the 
nematode Caenorhabditis elegans. 

Although the exact details of animal development can vary in bewildering 
ways depending on the species, all animal development exemplifies a common 


imperative: start with one cell, end with many different cells. In most animal 
species, development begins with the fertilized egg. Once the egg divides into its 
daughter cells, becoming an embryo, the organism begins heading toward a well- 
defined target, namely, an adult form that can reproduce. Arriving at that distant 
target requires the embryo to produce many specialized cell types, in the correct 
positions and at the right time. 

Cell differentiation involves coordinating the expression of specific genes in 
space and time, as the number of cells, taking on their different roles, rises from 
one to two to four to eight, doubling and doubling until it reaches thousands, 
millions, and even trillions, depending on the species. The number of cell 
divisions and the total number of cells reflects the number of different cell types 
the adult needs. This in turn requires producing different proteins for different 
cell types. 

For example, the specialized digestive proteins that service the cells lining the 
adult intestine differ from proteins expressed in a neuron found in the nerve tract 
of a limb. They must differ because each performs dramatically different 
functions. So, during development, the appropriate genes must be turned on, or 
“up-regulated,” and turned off, or “down-regulated,” to ensure the production of 
the correct protein products at the right time and in the right cell types. 

Specific proteins play active roles in regulating the expression of genes for 
building other proteins. The protein actors playing these coordinating roles are 
known as transcriptional regulators (TRs) or transcription factors (TFs). TRs (or 
TFs) usually bind directly to specific sites in DNA, either preventing 
(repressing) or enabling (activating) the transcription of specific genes into 
RNA. TRs or TFs convey instructions about which genes to turn on or turn off. 
Their three-dimensional geometries exhibit characteristic DNA-binding features, 
including a specific domain of 61 amino acids that wraps around the DNA 
double helix. Other transcription factors include the zinc finger and leucine 
zipper motifs that also bind to DNA. Transcriptional regulators and factors are 
themselves controlled by complex circuits and signals transmitted by other genes 
and proteins, the overall complexity and precision of which is breathtaking. 

Painstaking genetic research—performed by Niisslein-Volhard and 
Wieschaus and many other developmental biologists°>—has uncovered many of 
the key embryonic regulatory genes that help switch cells into their differentiated 
adult types. This research also uncovered a profound difficulty cutting to the 
very core of the neo-Darwinian view of life. 


EARLY-ACTING BODY-PLAN MUTATIONS AND EMBRYONIC 
LETHALS 


To create significant changes in the forms of animals requires attention to timing. 
Mutations in genes expressed late in the development of an animal will affect 
relatively few cells and architectural features. That’s because by late in 
development the basic outlines of the body plan have already been established.° 
Late-acting mutations therefore cannot cause any significant or heritable changes 
in the form or body plan of the whole animal. Mutations that are expressed early 
in development, however, may affect many cells and could conceivably produce 
significant changes in the form or body plan, especially if these changes occur in 
key regulatory genes.’ Thus, mutations that are expressed early in the 
development of animals have probably the only realistic chance of producing 
large-scale macroevolutionary change.® As evolutionary geneticists Bernard 
John and George Miklos explain, “macroevolutionary change” requires changes 
in “very early embryogenesis.”? Former Yale University evolutionary biologist 
Keith Thomson concurs: only mutations expressed early in the development of 
organisms can produce large-scale macroevolutionary change.!° 

Yet from the first experiments by geneticist T. H. Morgan systematically 
mutating fruit flies early in the twentieth century until today, as many model 
species have been subjected to mutagenesis, developmental biology has shown 
that mutations affecting body-plan formation expressed early in development 
inevitably damage the organism.''! (See Fig. 13.2, for examples.) As one of the 
founders of neo-Darwinism geneticist R. A. Fisher noted, such mutations are 
“either definitely pathological (most often lethal) in their effects,” or they result 
in an organism that cannot survive “in the wild state.” ! 

Normal development in any animal can be represented as an expanding 
network of decisions, where the earliest (upstream) decisions have greater 
impact than those occurring later. Regulatory genes and their DNA-binding 
protein products help to control this unfolding network of decisions—such that if 
regulatory proteins are altered or destroyed by mutation, the effects cascade 
downstream into the whole developmental process. The earlier the failure, the 
more widespread the destruction. Geneticist Bruce Wallace explains why early- 
acting mutations are thus overwhelmingly likely to disrupt animal development. 
“The extreme difficulty encountered,” he observes, “when attempting to 
transform one organism into another . . . still functional one lies in the difficulty 
in resetting a number of the many controlling switches in a manner that still 


allows for the individual’s orderly (somatic) development.” !? 


Niisslein-Volhard and Wieschaus discovered this problem in experiments 
performed on fruit flies after their first Nobel Prize-winning efforts. In these 
later experiments they studied protein molecules that influence the organization 
of different types of cells early in the process of embryological development. 
These molecules, called “morphogens,” including one called Bicoid, are critical 
to establishing the fruit fly’s head-to-tail axis. They found that when these early- 
acting, body-plan-affecting molecules are perturbed, development shuts down. 
When mutations occur in the gene that codes for Bicoid, the resulting embryos 
die!4—as they do in all other known cases in which mutations occur early in the 
regulatory genes that affect body-plan formation. 

There are good functional reasons for this, familiar to us from the logic of 
other complex systems. If an automaker modifies a car’s paint color or seat 
covers, nothing else needs to be altered for the car to operate, because the normal 
function of the car does not depend upon these features. But if an engineer 
changes the length of the piston rods in the car’s engine, and does not modify the 
crankshaft accordingly, the engine won’t run. Similarly, animal development is a 
tightly integrated process in which various proteins and cell structures depend 
upon each other for their function, and later events depend crucially on earlier 
events. As a result, one change early in the development of an animal will 
require a host of other coordinated changes in separate but functionally 
interrelated developmental processes and entities downstream.!° This tight 
functional integration helps explain why mutations early in development 
inevitably result in embryonic death and why even mutations expressed 
somewhat later in development commonly leave organisms crippled. 

Looking more closely at a specific experimental result of this kind further 
illuminates the problem. A mutation in the regulatory Ultrabithorax gene 
(expressed midway in the development of a fly) produces an extra pair of wings 
on a normally two-winged creature. Although an extra set of wings may sound 
like a useful piece of equipment, it’s not at all. This “innovation” results in a 
crippled insect that cannot fly because it lacks, among other things, a 
musculature to support the use of its new wings. Because the developmental 
mutation was not accompanied by the many other coordinated developmental 
changes that were needed to make the wings useful, the mutation is decidedly 
harmful. 





THE "SHORT WINGS" MUTANT THE "CURLY WINGS" MUTANT 
CANNCT FLY CANNOT FLY 








THE "EYELES" MUTANT THE "ANTENNAPEDIA" MUTANTS 
I$ COMPLETELY BLIND HAVE LEGS WHERE THEIR 
ANTENNAE SHOULD BE AND 
CANNCT REPRODUCE. 
FIGURE 13.2 


Examples of deleterious macromutations produced by experiments on fruit flies, 
including the “short wings,” “curly wings,” “eyeless,’ and Antennapedia 
mutants. 


This problem has led to what Georgia Tech geneticist John F. McDonald has 
called a “great Darwinian paradox.”!© He notes that the genes that are obviously 
variable within natural populations seem to affect only minor aspects of form 
and function—while those genes that govern major changes, the very stuff of 
macroevolution, apparently do not vary or vary only to the detriment of the 
organism. As he puts it, “Those [genetic] loci that are obviously variable within 
natural populations do not seem to lie at the basis of many major adaptive 
changes, while those loci that seemingly do constitute the foundation of many if 
not most major adaptive changes are not variable within natural populations.”!7 
In other words, the kind of mutations the evolutionary process would need to 
produce new animal body plans—namely, beneficial regulatory changes 
expressed early in development—don’t occur. Whereas, the kind that it doesn’t 
need—viable genetic mutations in DNA generally expressed late in development 
—do occur. Or put more succinctly, the kind of mutations we need for major 
evolutionary change we don’t get; the kind we get we don’t need. 

My Discovery Institute colleague Paul Nelson (see Fig. 13.3), a philosopher 
of biology who specializes in evolutionary theory and developmental biology, 


summarizes the challenge to neo-Darwinism posed by animal development as 
three premises: 


1. Animal body plans are built in each generation by a stepwise process, from 
the fertilized egg to the many cells of the adult. The earliest stages in this 
process determine what follows. 

2. Thus, to evolve any body plan, mutations expressed early in development 
must occur, must be viable, and must be stably transmitted to offspring. 

3. Such early-acting mutations of global effect on animal development, 
however, are those least likely to be tolerated by the embryo and, in fact, 
never have been tolerated in any animals that developmental biologists have 
studied. 





FIGURE 13.3 
Paul Nelson. Courtesy Paul Nelson. 


Nelson came to appreciate the depth of the problem posed by these facts after 
many years of discussion with two members of his University of Chicago Ph.D. 
committee, evolutionary biologist Leigh Van Valen (1935-2010) and 
evolutionary theorist and philosopher of biology William Wimsatt. Van Valen, 
famous for his “Red Queen hypothesis” about the need for organisms to continue 
to evolve in order to maintain fitness, was passionately interested in the 


mechanisms of macroevolution. Wimsatt originated the theory of “generative 
entrenchment,” an account of the “causal asymmetries” at work in complex 
systems, including those responsible for animal development.'® Both 
acknowledged to Nelson that the scientific literature offers no examples of viable 
mutations affecting early animal development and body-plan formation (Premise 
3, on previous page) and also that the macroevolution of novel animal form 
requires just such early-acting mutations (Premise 2, on previous page). 
Nevertheless, both Van Valen and Wimsatt remained committed to the descent of 
animal forms from a common ancestor via some kind of undirected mutations. 
Nelson argues, however, that those premises strongly imply that the neo- 
Darwinian mechanism does not—and indeed cannot—provide an adequate 
mechanism for producing new animal body plans. As he has told me: “If the 
only kind of mutations that can conceivably produce enough morphological 
change to alter whole body plans never causes beneficial and heritable changes, 
then it is difficult to see how mutation and selection could ever produce new 
body plans in the first place.” 
Thus, he concludes: 


Research on animal development and macroevolution over the last thirty 
years—research done from within the neo-Darwinian framework—has 
shown that the neo-Darwinian explanation for the origin of new body 
plans is overwhelmingly likely to be false—and for reasons that Darwin 
himself would have understood. 


Indeed, Darwin himself insisted that “nothing can be effected” by natural 
selection, “unless favorable variations occur.”*? Or as Swedish evolutionary 
biologist Sgren Lgvtrup succinctly explains: “Without variation, no selection; 
without selection, no evolution. This assertion is based on logic of the simplest 
kind. . . . Selection pressure as an evolutionary agent becomes void of sense 
unless the availability of the proper mutations is assumed.”*! Yet the “proper” 
kind of mutations—the mutations that produce favorable changes to early- 
acting, body-plan—shaping, regulatory genes—do not occur. 

Microevolutionary change is_ insufficient; macromutations—large-scale 
changes—are harmful. This paradox has beset Darwinism from its inception, but 
discoveries about the genetic regulation of development in animals have made 
this paradox more acute and cast serious doubt on the efficacy of the modern 
neo-Darwinian mechanism as an explanation for the new body plans that arise in 


the Cambrian period. 


DEVELOPMENTAL GENE REGULATORY NETWORKS 


Another line of research in developmental biology has revealed a related 
challenge to the creative power of the neo-Darwinian mechanism. 
Developmental biologists have discovered that many gene products (proteins 
and RNAs) needed for the development of specific animal body plans transmit 
signals that influence the way individual cells develop and differentiate 
themselves. Additionally, these signals affect how cells are organized and 
interact with each other during embryological development. These signaling 
molecules influence each other to form circuits or networks of coordinated 
interaction, much like integrated circuits on a circuitboard. For example, exactly 
when a signaling molecule gets transmitted often depends upon when a signal 
from another molecule is received, which in turn affects the transmission of still 
others—all of which are coordinated and integrated to perform specific time- 
critical functions. The coordination and integration of these signaling molecules 
in cells ensures the proper differentiation and organization of distinct cell types 
during the development of an animal body plan. Consequently, just as mutating 
an individual regulatory gene early in the development of an animal will 
inevitably shut down development, so too will mutations or alterations in the 
whole network of interacting signaling molecules destroy a developing embryo. 

No biologist has explored the regulatory logic of animal development more 
deeply than Eric Davidson, at the California Institute of Technology. Early in his 
career, collaborating with molecular biologist Roy Britten, Davidson formulated 
a theory of “gene regulation for higher cells.”** By “higher cells” Davidson and 
Britten meant the differentiated, or specialized, cells found in any animal after 
the earliest stages of embryological development. Davidson observed that the 
cells of an individual animal, no matter how varied in form or function, 
“generally contain identical genomes.”*? During the life cycle of an organism, 
the genomes of these specialized cells express only a small fraction of their DNA 
at any given time and produce different RNAs as a result. These facts strongly 
suggest that some animal-wide system of genetic control functions to tum 
specific genes on and off as needed throughout the life of the organism—and 
that such a system functions during the development of an animal from egg to 
adult as different cell types are being constructed. 

When they proposed their theory in 1969, Britten and Davidson 
acknowledged that “little is known . . . of the molecular mechanisms by which 


gene expression is controlled in differentiated cells.”*+ Nevertheless, they 
deduced that such a system must be at work. Given: (1) that tens or hundreds of 
specialized cell types arise during the development of animals, and (2) that each 
cell contains the same genome, they reasoned (3) that some control system must 
determine which genes are expressed in different cells at different times to 
ensure the differentiation of different cell types from each other—some system- 
wide regulatory logic must oversee and coordinate the expression of the 
genome.*? 

Davidson has dedicated his career to discovering and describing the 
mechanisms by which these systems of gene regulation and control work during 
embryological development. During the last two decades, research in genomics 
has revealed that nonprotein-coding regions of the genome control and regulate 
the timing of the expression of the protein-coding regions of the genome. 
Davidson has shown that the nonprotein-coding regions of DNA that regulate 
and control gene expression and the protein-coding regions of the genome 
together function as circuits. These circuits, which Davidson calls 
“developmental gene regulatory networks” (or dGRNs) control the 
embryological development of animals. 
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FIGURE 13.4 


Developmental gene regulatory networks (dGRNs) and development in the 
purple sea urchin, Strongylocentrotus purpuratus. Figure 13.4a (top, left): shows 
the actual embryo, starting at 6 hours and progressing through cell division to 55 
hours, when the larval skeleton appears. Figure 13.4b (bottom, left): depicts the 
major classes of genes involved in specifying the larval skeleton. Figure 13.4c 
(top, right): shows the detailed genetic circuitry implicated in the overall “gene 
regulatory network” (“GRN”) that controls the construction of the larval 
skeleton. Courtesy National Academy of Sciences, U.S.A. 


On arriving at Caltech in 1971, Davidson chose the purple sea urchin, 
Strongylocentrotus purpuratus, as his experimental model system. The biology 
of S. purpuratus makes it an attractive laboratory subject: the species occurs 
abundantly along the Pacific coast, produces enormous quantities of easily 
fertilized eggs in the lab, and lives for many years.*° Davidson and _ his 


coworkers pioneered the technology and experimental protocols required to 
dissect the sea urchin’s genetic regulatory system. 

The remarkable complexity of what they found needs to be depicted visually. 
Figure 13.4a shows the urchin embryo as it appears six hours after development 
has begun (top left of diagram). This is the 16-cell stage, meaning that four 
rounds of cell division have already occurred (1 + 2 + 4 = 8 = 16). As 
development proceeds in the next four stages, both the number of cells and the 
degree of cellular specialization increases, until, at 55 hours, elements of the 
urchin skeleton come into focus. Figure 13.4b shows, corresponding to these 
drawings of embryo development, a schematic diagram with the major classes of 
genes (for cell and tissue types) represented as boxes, linked by control arrows. 
Last, Figure 13.4c shows what Davidson calls “the genetic circuitry” that turns 
on the specific biomineralization genes that produce the structural proteins 
needed to build the urchin skeleton.” 

This last diagram represents a developmental gene regulatory network (or 
dGRN), an integrated network of protein and RNA-signaling molecules 
responsible for the differentiation and arrangement of the specialized cells that 
establish the rigid skeleton of the sea urchin. Notice that, to express the 
biomineralization genes that produce structural proteins that make the skeleton, 
genes far upstream, activated many hours earlier in development, must first play 
their role. 

This process does not happen fortuitously in the sea urchin but via highly 
regulated and precise control systems, as it does in all animals. Indeed, even one 
of the simplest animals, the worm C. elegans, possessing just over 1,000 cells as 
an adult, is constructed during development by dGRNs of remarkable precision 
and complexity. In all animals, the various dGRNs direct what Davidson 
describes as the embryo’s “progressive increase in complexity”—an increase, he 
writes, that can be measured in “informational terms.”2° 

Davidson notes that, once established, the complexity of the dGRNs as 
integrated circuits makes them stubbornly resistant to mutational change—a 
point he has stressed in nearly every publication on the topic over the past fifteen 
years. “In the sea urchin embryo,” he points out, “disarming any one of these 
subcircuits produces some abnormality in expression.”? 

Developmental gene regulatory networks resist mutational change because 
they are organized hierarchically. This means that some developmental gene 
regulatory networks control other gene regulatory networks, while some 
influence only the individual genes and proteins under their control. At the 





center of this regulatory hierarchy are the regulatory networks that specify the 
axis and global form of the animal body plan during development. These dGRNs 
cannot vary without causing catastrophic effects to the organism. 

Indeed, there are no examples of these deeply entrenched, functionally 
critical circuits varying at all. At the periphery of the hierarchy are gene 
regulatory networks that specify the arrangements for smaller-scale features that 
can sometimes vary. Yet, to produce a new body plan requires altering the axis 
and global form of the animal. This requires mutating the very circuits that do 
not vary without catastrophic effects. As Davidson emphasizes, mutations 
affecting the dGRNs that regulate body-plan development lead to “catastrophic 
loss of the body part or loss of viability altogether.”°° He explains in more detail: 


There is always an observable consequence if a dGRN subcircuit is 
interrupted. Since these consequences are always catastrophically bad, 
flexibility is minimal, and since the subcircuits are all interconnected, the 
whole network partakes of the quality that there is only one way for things 
to work. And indeed the embryos of each species develop in only one 
way.?! 


ENGINEERING CONSTRAINTS 


Davidson’s findings present a profound challenge to the adequacy of the neo- 
Darwinian mechanism. Building a new animal body plan requires not just new 
genes and proteins, but new dGRNs. But to build a new dGRN from a 
preexisting dGRN by mutation and selection necessarily requires altering the 
preexisting developmental gene regulatory network (the very kind of change 
that, as we saw in Chapter 12, cannot arise without multiple coordinated 
mutations). In any case, Davidson’s work has also shown that such alterations 
inevitably have catastrophic consequences. 

Davidson’s work highlights a profound contradiction between the neo- 
Darwinian account of how new animal body plans are built and one of the most 
basic principles of engineering—the principle of constraints. Engineers have 
long understood that the more functionally integrated a system is, the more 
difficult it is to change any part of it without damaging or destroying the system 
as a whole. Davidson’s work confirms that this principle applies to developing 
organisms in spades. The system of gene regulation that controls animal-body- 
plan development is exquisitely integrated, so that significant alterations in these 
gene regulatory networks inevitably damage or destroy the developing animal.** 
But given this, how could a new animal body plan, and the new dGRNs 
necessary to produce it, ever evolve gradually via mutation and selection from a 
preexisting body plan and set of dGRNs? 

Davidson makes clear that no one really knows: “contrary to classical 
evolution theory, the processes that drive the small changes observed as species 
diverge cannot be taken as models for the evolution of the body plans of 
animals.”°? He elaborates: 


Neo-Darwinian evolution . .. assumes that all process works the same 
way, so that evolution of enzymes or flower colors can be used as current 
proxies for study of evolution of the body plan. It erroneously assumes that 
change in protein-coding sequence is the basic cause of change in 
developmental program; and it erroneously assumes that evolutionary 
change in body-plan morphology occurs by a continuous process. All of 
these assumptions are basically counterfactual. This cannot be surprising, 
since the neo-Darwinian synthesis from which these ideas stem was a 
premolecular biology concoction focused on population genetics and .. . 


natural history, neither of which have any direct mechanistic import for 
the genomic regulatory systems that drive embryonic development of the 
body plan.*4 


NOW AND THEN 


Eric Davidson’s work, like that of Niisslein-Volhard and Wieschaus, highlights a 
difficulty of obvious relevance to the Cambrian explosion. ‘Typically, 
paleontologists understand the Cambrian explosion as the geologically sudden 
appearance of new forms of animal life. Building these forms requires new 
developmental programs— including both new early-acting regulatory genes and 
new developmental gene regulatory networks. Yet if neither early-acting 
regulatory genes nor dGRNs can be altered by mutation without destroying 
existing developmental programs (and thus animal form), then mutating these 
entities will leave natural selection with nothing favorable to select and the 
evolution of animal form will, at that point, terminate. 

Darwin’s doubt about the Cambrian explosion centered on the problem of 
missing fossil intermediates. Not only have those forms not been found, but the 
Cambrian explosion itself illustrates a profound engineering problem that fossil 
evidence does not address—the problem of building a new form of animal life 
by gradually transforming one tightly integrated system of genetic components 
and their products into another. Yet, in the next chapter, we will see that an even 
more formidable problem remains. 
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The Epigenetic Revolution 


In 1924, two German scientists, Hans Spemann and Hilda Mangold, reported an 
intriguing experiment, the significance of which could not have been fully 
appreciated at the time, three decades before the discovery of the information- 
bearing properties of DNA. Using microsurgery, Spemann and Mangold excised 
a portion of a newt embryo and transplanted that portion into another developing 
newt embryo.! 

They achieved a startling result. The second embryo produced two bodies, 
each with a head and tail, joined together at the belly, not unlike Siamese twins. 
Yet despite dramatically altering the anatomy of the embryo, Spemann and 
Mangold did not alter its DNA. 

Their experiment later suggested a radical possibility: that something in 
addition to DNA profoundly influences the development of animal body plans. 
Other experiments suggested as much. In the 1930s and 1940s, American 
biologist Ethel Harvey showed experimentally that sea urchin embryos could 
undergo development up to about 500 cells after removal of their nuclei—in 
other words, without their nuclear DNA.* In the 1960s, Belgian scientists 
chemically blocked the transcription of DNA into RNA in amphibian embryos 
and found that the embryos could still develop to the point of containing several 
thousand cells.° In the 1970s, Canadian biologists showed that a frog embryo 
could undergo early development without its nucleus if the cell division 
apparatus from a sea urchin was injected into the egg.* 

None of these results indicate that embryos can develop fully without DNA. 
In every case, DNA was eventually necessary to complete embryological 
development. Yet these results suggest that DNA is not the whole story, that 
other sources of information are playing important roles in directing at least the 
early stages of animal development. 


ABOVE AND BEYOND: EPIGENETIC INFORMATION 


In 2003, MIT Press published a groundbreaking collection of scientific essays 
titled Origination of Organismal Form: Beyond the Gene in Developmental and 
Evolutionary Biology, edited by two distinguished developmental and 
evolutionary biologists, Gerd Miiller, of the University of Vienna, and Stuart 
Newman, of New York Medical College. In their volume, Miiller and Newman 
included a number of scientific articles describing recent discoveries in genetics 
and developmental biology—discoveries suggesting that genes alone do not 
determine the three-dimensional form and structure of an animal. Instead, many 
of the scientists in their volume reported that so-called epigenetic information— 
information stored in cell structures, but not in DNA sequences—plays a crucial 
role. The Greek prefix epi means “above” or “beyond,” so epigenetics refers to a 
source of information that lies beyond the genes. As Miiller and Newman 
explain in their introduction, “Detailed information at the level of the gene does 
not serve to explain form.”° Instead, as Newman explains, “epigenetic” or 
“contextual information” plays a crucial role in the formation of animal “body 
assemblies” during embryological development.°® 

Miller and Newman not only highlighted the importance of epigenetic 
information to the formation of body plans during development; they also argued 
that it must have played a similarly important role in the origin and evolution of 
animal body plans in the first place. They concluded that recent discoveries 
about the role of epigenetic information in animal development pose a 
formidable challenge to the standard neo-Darwinian account of the origin of 
these body plans—perhaps the most formidable of all. 

In the introductory essay to their volume, Miiller and Newman list a number 
of “open questions” in evolutionary biology, including the question of the origin 
of Cambrian-era animal body plans and the origin of organismal form generally, 
the latter being the central topic of their book. They note that though “the neo- 
Darwinian paradigm still represents the central explanatory framework of 
evolution,” it has “no theory of the generative.”’ In their view, neo-Darwinism 
“completely avoids [the question of] the origination of phenotypic traits and of 
organismal form.”® As they and others in their volume maintain, neo-Darwinism 
lacks an explanation for the origin of organismal form precisely because it 
cannot explain the origin of epigenetic information. 

I first learned about the problem of epigenetic information and the Spemann 
and Mangold experiment while driving to a private meeting of Darwin-doubting 


scientists on the central coast of California in 1993. In the car with me was 
Jonathan Wells (see Fig. 14.1), who was then finishing a Ph.D. in developmental 
biology at the University of California at Berkeley. Like some others in his field, 
Wells had come to reject the (exclusively) “gene-centric” view of animal 
development and to recognize the importance of nongenetic sources of 
information. 

By that time, I had studied many questions and challenges to standard 
evolutionary theories arising out of molecular biology. But epigenetics was new 
to me. On our drive, I asked Wells why developmental biology was so important 
to evolutionary theory and to assessing neo-Darwinism. I’ll never forget his 
reply. “Because” he said, “that’s where the whole theory is going to unravel.” 





FIGURE 14.1 
Jonathan Wells. Courtesy Laszlo Bencze. 


In the years since, Wells has developed a powerful argument against the 
adequacy of the neo-Darwinian mechanism as an explanation for the origin of 
animal body plans. His argument turns on the importance of epigenetic 
information to animal development. To see why epigenetic information poses an 
additional challenge to neo-Darwinism and what exactly biologists mean by 
“epigenetic” information, let’s examine the relationship between biological form 
and biological information. 


FORM AND INFORMATION 


Biologists typically define “form” as a distinctive shape and arrangement of 
body parts. Organismal forms exist in three spatial dimensions and arise in time 
—in the case of animals during development from embryo to adult. Animal form 
arises as material constituents are constrained to establish specific arrangements 
with an identifiable three-dimensional shape or “topography”’—one that we 
would recognize as the body plan of a particular type of animal. A particular 
“form,” therefore, represents a highly specific arrangement of material 
components among a much larger set of possible arrangements. 

Understanding form in this way suggests a connection to the notion of 
information in its most theoretically general sense. As I noted in Chapter 8, 
Shannon’s mathematical theory of information equated the amount of 
information transmitted with the amount of uncertainty reduced or eliminated in 
a series of symbols or characters. Information, in Shannon’s theory, is thus 
imparted as some options, or possible arrangements, are excluded and others are 
actualized. The greater the number of arrangements excluded, the greater the 
amount of information conveyed. Constraining a set of possible material 
arrangements, by whatever means, involves excluding some options and 
actualizing others. Such a process generates information in the most general 
sense of Shannon’s theory. It follows that the constraints that produce biological 
form also impart information, even if this information is not encoded in digital 
form. 

DNA contains not only Shannon information but also functional or specified 
information. The arrangements of nucleotides in DNA or of amino acids in a 
protein are highly improbable and thus contain large amounts of Shannon 
information. But the function of DNA and proteins depends upon extremely 
specific arrangements of bases and amino acids. 

Similarly, animal body plans represent, not only highly improbable, but also 
highly specific arrangements of matter. Organismal form and function depend 
upon the precise arrangement of various constituents as they arise during, or 
contribute to, embryological development. Thus, the specific arrangement of the 
other building blocks of biological form—cells, clusters of similar cell types, 
dGRNs, tissues, and organs—also represent a kind of specified or functional 
information. 

In Chapter 8, I noted that the ease with which Shannon’s information theory 


applies to molecular biology has sometimes led to confusion about the kind of 
information contained in DNA and proteins. It may have also created confusion 
about the places that specified information might reside in organisms. Perhaps 
because the information-carrying capacity of the gene can be so easily measured, 
biologists have often treated DNA, RNA, and proteins as the sole repositories of 
biological information. Neo-Darwinists have assumed that genes possess all the 
information necessary to specify the form of an animal. They have also assumed 
that mutations in genes will suffice to generate the new information necessary to 
build a new form of animal life.? Yet if biologists understand organismal form as 
resulting from constraints on the possible arrangements of matter at many levels 
in the biological hierarchy—from genes and proteins, to cell types and tissues, to 
organs and body plans—then biological organisms may well exhibit many levels 
of information-rich structure. Discoveries in developmental biology have 
confirmed this possibility. 


BEYOND GENES 


Many biologists no longer believe that DNA directs virtually everything 
happening within the cell. Developmental biologists, in particular, are now 
discovering more and more ways that crucial information for building body 
plans is imparted by the form and structure of embryonic cells, including 
information from both the unfertilized and fertilized egg. 

Biologists now refer to these sources of information as “epigenetic. 
Spemann and Mangold’s experiment is only one of many to suggest that 
something beyond DNA may be influencing the development of animal body 
plans. Since the 1980s, developmental and cell biologists such as Brian 
Goodwin, Wallace Arthur, Stuart Newman, Fred Nijhout, and Harold Franklin 
have discovered or analyzed many sources of epigenetic information. Even 
molecular biologists such as Sidney Brenner, who pioneered the idea that genetic 
programs direct animal development, now insist that the information needed to 
code for complex biological systems vastly outstrips the information in DNA.'! 

DNA helps direct protein synthesis. Parts of the DNA molecule also help to 
regulate the timing and expression of genetic information and the synthesis of 
various proteins within cells. Yet once proteins are synthesized, they must be 
arranged into higher-level systems of proteins and structures. Genes and proteins 
are made from simple building blocks—nucleotide bases and amino acids, 
respectively—arranged in specific ways. Similarly, distinctive cell types are 
made of, among other things, systems of specialized proteins. Organs are made 
of specialized arrangements of cell types and tissues. And body plans comprise 
specific arrangements of specialized organs. Yet the properties of individual 
proteins do not fully determine the organization of these higher-level structures 
and patterns.'* Other sources of information must help arrange individual 
proteins into systems of proteins, systems of proteins into distinctive cell types, 
cell types into tissues, and different tissues into organs. And different organs and 
tissues must be arranged to form body plans. 
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The hierarchical layering or arrangement of different sources of information. 
Note that the information necessary to build the lower-level electronic 
components does not determine the arrangement of those components on the 
circuit board or the arrangement of the circuit board and the other parts 
necessary to make a computer. That requires additional informational inputs. 


Two analogies may help clarify the point. At a construction site, builders will 
make use of many materials: lumber, wires, nails, drywall, piping, and windows. 
Yet building materials do not determine the floor plan of the house or the 
arrangement of houses in a neighborhood. Similarly, electronic circuits are 
composed of many components, such as resistors, capacitors, and transistors. But 
such lower-level components do not determine their own arrangement in an 
integrated circuit (see Fig. 14.2). 

In a similar way, DNA does not by itself direct how individual proteins are 
assembled into these larger systems or structures—cell types, tissues, organs, 
and body plans—during animal development.'° Instead, the three-dimensional 
structure or spatial architecture of embryonic cells plays important roles in 
determining body-plan formation during embryogenesis. Developmental 
biologists have identified several sources of epigenetic information in these 
cells. 


CYTOSKELETAL ARRAYS 


Eukaryotic cells have internal skeletons to give them shape and stability. These 
“cytoskeletons” are made of several different kinds of filaments including those 
called the “microtubules.” The structure and location of the microtubules in the 
cytoskeleton influence the patterning and development of embryos. Microtubule 
“arrays” within embryonic cells help to distribute essential proteins used during 
development to specific locations in these cells. Once delivered, these proteins 
perform functions critical to development, but they can only do so if they are 
delivered to their correct locations with the help of preexisting, precisely 
structured microtubule or cytoskeletal arrays (see Fig. 14.3). Thus, the precise 
arrangement of microtubules in the cytoskeleton constitutes a form of critical 
structural information. 

These microtubule arrays are made of proteins called tubulin, which are gene 
products. Nevertheless, like bricks that can be used to assemble many different 
structures, the tubulin proteins in the cell’s microtubules are identical to one 
another. Thus, neither the tubulin subunits, nor the genes that produce them, 
account for the differences in the shape of the microtubule arrays that distinguish 
different kinds of embryos and developmental pathways. Instead, the structure of 
the microtubule array itself is, once again, determined by the location and 
arrangement of its subunits, not the properties of the subunits themselves. 
Jonathan Wells explains it this way: “What matters in [embryological] 
development is the shape and location of microtubule arrays, and the shape and 
location of a microtubule array is not determined by its units.”! For this reason, 
as University of Colorado cell biologist Franklin Harold notes, it is impossible to 
predict the structure of the cytoskeleton of the cell from the characteristics of the 
protein constituents that form that structure.!° 





FIGURE 14.3 

Figure 14.3a (left) shows a still shot from an animation of microtubule (at the 
bottom of the image) made of tubulin proteins. Courtesy Joseph Condeelis. 
Figure 14.3b (right) shows a microscopic image of a large section of 
cytoskeleton made of many microtubules (and other elements) inside the cell in 
cross section. Courtesy The Company of Biologists and Journal of Cell Science. 


Another cell structure influences the arrangement of the microtubule arrays 
and thus the precise structures they form and the functions they perform. In an 
animal cell, that structure is called the centrosome (literally, “central body”), a 
microscopic organelle that sits next to the nucleus between cell divisions in an 
undividing cell. Emanating from the centrosome is the microtubule array that 
gives a cell its three-dimensional shape and provides internal tracks for the 
directed transport of organelles and essential molecules to and from the 
nucleus.!° During cell division the centrosome duplicates itself. The two 
centrosomes form the poles of the cell-division apparatus, and each daughter cell 
inherits one of the centrosomes; yet the centrosome contains no DNA.” Though 
centrosomes are made of proteins—gene products—the centrosome structure is 
not determined by genes alone. 


MEMBRANE PATTERNS 


Another important source of epigenetic information resides in the two- 
dimensional patterns of proteins in cell membranes.!® When messenger RNAs 
are transcribed, their protein products must be transported to the proper locations 
in embryonic cells in order to function properly. Directed transport involves the 
cytoskeleton, but it also depends on spatially localized targets in the membrane 
that are in place before transport occurs. Developmental biologists have shown 
that these membrane patterns play a crucial role in the embryological 
development of fruit flies. 


Membrane Targets 


For example, early embryo development in the fruit fly Drosophila 
melanogaster requires the regulatory molecules Bicoid and Nanos (among 
others). The former is required for anterior (head) development, and the latter is 
required for posterior (tail) development.!? In the early stages of embryological 
development, nurse cells pump Bicoid and Nanos RNAs into the egg. (Nurse 
cells provide the cell that will become the egg—known as the oocyte—and the 
embryo with maternally encoded messenger RNAs and proteins.) Cytoskeletal 
arrays then transport these RNAs through the oocyte, where they become 
attached to specified targets on the inner surface of the egg.*° Once in their 
proper place—but only then—Bicoid and Nanos play critical roles in organizing 
the head-to-tail axis of the developing fruit fly. They do this by forming two 
gradients (or differential concentrations), one with Bicoid protein most 
concentrated at the anterior end and another with Nanos protein most 
concentrated at the posterior end. 

Insofar as both of these molecules are RNAs—that is, gene products—genetic 
information plays an important role in this process. Even so, the information 
contained in the bicoid and nanos genes does not by itself ensure the proper 
function of the RNAs and proteins for which the genes code. Instead, preexisting 
membrane targets, already positioned on the inside surface of the egg cell, 
determine where these molecules will attach and how they will function. These 
membrane targets provide crucial information—spatial coordinates—for 
embryological development. 


Ion Channels and Electromagnetic Fields 


Membrane patterns can also provide epigenetic information by the precise 
arrangement of ion channels—openings in the cell wall through which charged 
electrical particles pass in both directions. For example, one type of channel uses 
a pump powered by the energy-rich molecule ATP to transport three sodium ions 
out of the cell for every two potassium ions that enter the cell. Since both ions 
have a charge of plus one (Nat, K+), the net difference sets up an 
electromagnetic field across the cell membrane.*! 

Experiments have shown that electromagnetic fields have “morphogenetic” 
effects—in other words, effects that influence the form of a developing 
organism. In particular, some experiments have shown that the targeted 
disturbance of these electric fields disrupts normal development in ways that 
suggest the fields are controlling morphogenesis.** Artificially applied electric 
fields can induce and guide cell migration. There is also evidence that direct 
current can affect gene expression, meaning internally generated electric fields 
can provide spatial coordinates that guide embryogenesis.*? Although the ion 
channels that generate the fields consist of proteins that may be encoded by 
DNA (just as microtubules consist of subunits encoded by DNA), their pattern in 
the membrane is not. Thus, in addition to the information in DNA that encodes 
morphogenetic proteins, the spatial arrangement and distribution of these ion 
channels influences the development of the animal. 


The Sugar Code 


Biologists know of an additional source of epigenetic information stored in the 
arrangement of sugar molecules on the exterior surface of the cell membrane. 
Sugars can be attached to the lipid molecules that make up the membrane itself 
(in which case they are called “glycolipids”), or they can be attached to the 
proteins embedded in the membrane (in which case they are called 
“glycoproteins”). Since simple sugars can be combined in many more ways than 
amino acids, which make up proteins, the resulting cell surface patterns can be 
enormously complex. As biologist Ronald Schnaar explains, “Each [sugar] 
building block can assume several different positions. It is as if an A could serve 
as four different letters, depending on whether it was standing upright, turned 
upside down, or laid on either of its sides. In fact, seven simple sugars can be 
rearranged to form hundreds of thousands of unique words, most of which have 
no more than five letters.”74 

These sequence-specific information-rich structures influence the 
arrangement of different cell types during embryological development. Thus, 
some cell biologists now refer to the arrangements of sugar molecules as the 
“sugar code” and compare these sequences to the digitally encoded information 
stored in DNA.*° As biochemist Hans-Joachim Gabius notes, sugars provide a 
system with “high-density coding” that is “essential to allow cells to 
communicate efficiently and swiftly through complex surface interactions.”7° 
According to Gabius, “These [sugar] molecules surpass amino acids and 
nucleotides by far in information-storing capacity.”?’ So the precisely arranged 
sugar molecules on the surface of cells clearly represent another source of 
information independent of that stored in DNA base sequences. 


NEO-DARWINISM AND THE CHALLENGE OF EPIGENETIC 
INFORMATION 


These different sources of epigenetic information in embryonic cells pose an 
enormous challenge to the sufficiency of the neo-Darwinian mechanism. 
According to neo-Darwinism, new information, form, and structure arise from 
natural selection acting on random mutations arising at a very low level within 
the biological hierarchy—within the genetic text. Yet both body-plan formation 
during embryological development and major morphological innovation during 


the history of life depend upon a specificity of arrangement at a much higher 
level of the organizational hierarchy, a level that DNA alone does not determine. 
If DNA isn’t wholly responsible for the way an embryo develops—for body-plan 
morphogenesis—then DNA sequences can mutate indefinitely and still not 
produce a new body plan, regardless of the amount of time and the number of 
mutational trials available to the evolutionary process. Genetic mutations are 
simply the wrong tool for the job at hand. 

Even in a best-case scenario—one that ignores the immense improbability of 
generating new genes by mutation and selection—mutations in DNA sequence 
would merely produce new genetic information. But building a new body plan 
requires more than just genetic information. It requires both genetic and 
epigenetic information—information by definition that is not stored in DNA and 
thus cannot be generated by mutations to the DNA. It follows that the 
mechanism of natural selection acting on random mutations in DNA cannot by 
itself generate novel body plans, such as those that first arose in the Cambrian 
explosion. 


GENE-CENTRIC RESPONSES 


Many of the biological structures that impart important three-dimensional spatial 
information—such as cytoskeletal arrays and membrane ion channels—are made 
of proteins. For this reason, some biologists have insisted that the genetic 
information in DNA that codes for these proteins does account for the spatial 
information in these various structures after all. In each case, however, this 
exclusively “gene-centric” view of the location of biological information—and 
the origin of biological form—has proven inadequate. 

First, in at least the case of the sugar molecules on the cell surface, gene 
products play no direct role. Genetic information produces proteins and RNA 
molecules, not sugars and carbohydrates. Of course, important glycoproteins and 
glycolipids (sugar-protein and sugar-fat composite molecules) are modified as 
the result of biosynthetic pathways involving networks of proteins. Nevertheless, 
the genetic information that generates the proteins in these pathways only 
determines the function and structure of the individual proteins; it does not 
specify the coordinated interaction between the proteins in the pathways that 
result in the modification of sugars.7® 

More important, the location of specific sugar molecules on the exterior 
surface of embryonic cells plays a critical role in the function that these sugar 


molecules play in intercellular communication and arrangement. Yet their 
location is not determined by the genes that code for the proteins to which these 
sugar molecules might be attached. Instead, research suggests that protein 
patterns in the cell membrane are transmitted directly from parent membrane to 
daughter membrane during cell division rather than as a result of gene 
expression in each new generation of cells.*? Since the sugar molecules on the 
exterior of the cell membrane are attached to proteins and lipids, it follows that 
their position and arrangement probably result from membrane-to-membrane 
transmission as well. 

Consider next the membrane targets that play a crucial role in embryological 
development by attracting morphogenetic molecules to specific places on the 
inner surface of the cell. These membrane targets consist largely of proteins, 
most of which are mainly specified by DNA. Even so, many “intrinsically 
disordered”*° proteins fold differently depending on the surrounding cellular 
context. This context thus provides epigenetic information. Further, many 
membrane targets include more than one protein, and these multiprotein 
structures do not automatically self-organize to form properly structured 
targets.2! Finally, it is not only the molecular structure of these membrane 
targets, but also their specific location and distribution that determines their 
function. Yet the location of these targets on the inner surface of the cell is not 
determined by the gene products out of which they are made any more than, for 
example, the locations of the bridges across the River Seine in Paris are 
determined by the properties of the stones out of which they are made. 

Similarly, the sodium-potassium ion pumps in cell membranes are indeed 
made of proteins. Nevertheless, it is, again, the location and distribution of those 
channels and pumps in the cell membrane that establish the contours of the 
electromagnetic field that, in turn, influence embryological development. The 
protein constituents of these channels do not determine where the ion channels 
are located. 

Like membrane targets and ion channels, microtubules are also made of many 
protein subunits, themselves undeniably the products of genetic information. In 
the case of microtubule arrays, defenders of the gene-centric view do not claim 
that individual tubulin proteins determine the structure of these arrays. 
Nevertheless, some have suggested that other proteins, or suites of proteins, 
acting in concert could determine such higher-level form. For example, some 
biologists have noted that so-called helper proteins—which are gene products— 
called “microtubule associated proteins” (MAPs) help to assemble the tubulin 


subunits in the microtubule arrays. 

Yet MAPs, and indeed many other necessary proteins, are only part of the 
story. The locations of specified target sites on the interior of the cell membrane 
also help to determine the shape of the cytoskeleton. And, as noted, the gene 
products out of which these targets are made do not determine the location of 
these targets. Similarly, the position and structure of the centrosome—the 
microtubule-organizing center—also influences the structure of the cytoskeleton. 
Although centrosomes are made of proteins, the proteins that form these 
structures do not entirely determine their location and form. As Mark McNiven, 
a molecular biologist at the Mayo Clinic, and cell biologist Keith Porter, 
formerly of the University of Colorado, have shown, centrosome structure and 
membrane patterns as a whole convey three-dimensional structural information 
that helps determine the structure of the cytoskeleton and the location of its 
subunits.°* Moreover, as several other biologists have shown, the centrioles that 
compose the centrosomes replicate independently of DNA replication: daughter 
centrioles receive their form from the overall structure of the mother centriole, 
not from the individual gene products that constitute them.*° 

Additional evidence of this kind comes from ciliates, large single-celled 
eukaryotic organisms. Biologists have shown that microsurgery on the cell 
membranes of ciliates can produce heritable changes in membrane patterns 
without altering the DNA.°** This suggests that membrane patterns (as opposed 
to membrane constituents) are impressed directly on daughter cells. In both cases 
—in membrane patterns and centrosomes—form is transmitted from parent 
three-dimensional structures to daughter three-dimensional structures directly. It 
is not entirely contained in DNA sequences or the proteins for which these 
sequences code.*° 

Instead, in each new generation, the form and structure of the cell arises as 
the result of both gene products and the preexisting three-dimensional structure 
and organization inherent in cells, cell membranes, and cytoskeletons. Many 
cellular structures are built from proteins, but proteins find their way to correct 
locations in part because of preexisting three-dimensional patterns and 
organization inherent in cellular structures. Neither structural proteins nor the 
genes that code for them can alone determine the three-dimensional shape and 
structure of the entities they build. Gene products provide necessary, but not 
sufficient, conditions for the development of three-dimensional structure within 
cells, organs, and body plans.°° If this is so, then natural selection acting on 
genetic variation and mutations alone cannot produce the new forms that arise in 


the history of life. 


EPIGENETIC MUTATIONS 


When I explain this in public talks, I can count on getting the same question. 
Someone in the audience will ask whether mutations could alter the structures in 
which epigenetic information resides. The questioner wonders if changes in 
epigenetic information could supply the variation and innovation that natural 
selection needs to generate new form, in much the same way that neo-Darwinists 
envision genetic mutations doing so. It’s a reasonable thing to ask, but it turns 
out that mutating epigenetic information doesn’t offer a realistic way of 
generating new forms of life. 

First, the structures in which epigenetic information inheres—cytoskeletal 
arrays and membrane patterns, for example—are much larger than individual 
nucleotide bases or even stretches of DNA. For this reason, these structures are 
not vulnerable to alteration by many of the typical sources of mutation that affect 
genes such as radiation and chemical agent. 

Second, to the extent that cell structures can be altered, these alterations are 
overwhelmingly likely to have harmful or catastrophic consequences. The 
original Spemann and Mangold experiment did, of course, involve forcibly 
altering an important repository of epigenetic information in a developing 
embryo. Yet the resulting embryo, though interesting and illustrative of the 
importance of epigenetic information, did not stand a chance of surviving in the 
wild, let alone reproducing. 

Altering the cell structures in which epigenetic information inheres will likely 
result in embryo death or sterile offspring—for much the same reason that 
mutating regulatory genes or developmental gene regulatory networks also 
produces evolutionary dead ends. The epigenetic information provided by 
various cell structures is critical to body-plan development, and many aspects of 
embryological development depend upon the precise three-dimensional 
placement and location of these information-rich cell structures. For example, 
the specific function of morphogenetic proteins, the regulatory proteins produced 
by master regulatory (Hox) genes, and developmental gene regulatory networks 
(dGRNs) all depend upon the location of specific, information-rich, and 
preexisting cell structures. For this reason, altering these cell structures will in all 
likelihood damage something else crucial during the developmental trajectory of 
the organism. Too many different entities involved in development depend for 
their proper function upon epigenetic information for such changes to have a 


beneficial or even neutral effect. 

In Chapter 16 I will examine several new theories of evolution, including one 
known as “epigenetic inheritance.” We’ll see that there are some additional 
difficulties associated with the idea that mutations in epigenetic structures can 
produce significant evolutionary innovation. 


DARWIN’S GROWING ANOMALY 


With the publication of On the Origin of Species in 1859, Darwin advanced, first 
and foremost, an explanation for the origin of biological form. At the time, he 
acknowledged that the pattern of appearance of the Cambrian animals did not 
conform to his gradualist picture of the history of life. Thus, he regarded the 
Cambrian explosion as primarily a problem of incompleteness in the fossil 
record. 

In Chapters 2, 3, and 4, I explained why the problem of fossil discontinuity 
exemplified by the Cambrian forms has, since Darwin’s time, only intensified. 
Yet clearly a more fundamental problem now afflicts the whole edifice of 
modern neo-Darwinian theory. The neo-Darwinian mechanism does not account 
for either the origin of the genetic or the epigenetic information necessary to 
produce new forms of life. Consequently, the problems posed to the theory by 
the Cambrian explosion remain unsolved. But further, the central problem that 
Darwin set out to answer in 1859, namely the origin of animal form in general, 
remains unanswered—as Miiller and Newman in particular have noted.?” 

Contemporary critics of neo-Darwinism acknowledge, of course, that 
preexisting forms of life can diversify under the twin influences of natural 
selection and genetic mutation. Known microevolutionary processes can account 
for small changes in the coloring of peppered moths, the acquisition of antibiotic 
resistance in different strains of bacteria, and cyclical variations in the size of 
Galapagos finch beaks. Nevertheless, many biologists now argue that neo- 
Darwinian theory does not provide an adequate explanation for the origin of new 
body plans or events such as the Cambrian explosion. 

For example, evolutionary biologist Keith Stewart Thomson, formerly of Yale 
University, has expressed doubt that large-scale morphological changes could 
accumulate by minor changes at the genetic level.*? Geneticist George Miklos, 
of the Australian National University, has argued that neo-Darwinism fails to 
provide a mechanism that can produce large-scale innovations in form and 
structure.°? Biologists Scott Gilbert, John Opitz, and Rudolf Raff have attempted 


to develop a new theory of evolution to supplement classical neo-Darwinism, 
which, they argue, cannot adequately explain large-scale macroevolutionary 
change. As they note: 


Starting in the 1970s, many biologists began questioning its [neo- 
Darwinism’s] adequacy in explaining evolution. Genetics might be 
adequate for explaining microevolution, but microevolutionary changes in 
gene frequency were not seen as able to turn a reptile into a mammal or to 
convert a fish into an amphibian. Microevolution looks at adaptations that 
concern the survival of the fittest, not the arrival of the fittest. As Goodwin 
(1995) points out, “the origin of species—Darwin’s problem—remains 
unsolved. ”4? 


Gilbert and his colleagues have tried to solve the problem of the origin of 
form by invoking mutations in genes called Hox genes, which regulate the 
expression of other genes involved in animal development—an approach that I 
will examine in Chapter 16.4! Notwithstanding, many leading biologists and 
paleontologists—Gerry Webster and Brian Goodwin, Giinter Theissen, Marc 
Kirschner, and John Gerhart, Jeffrey Schwartz, Douglas Erwin, Eric Davidson, 
Eugene Koonin, Simon Conway Morris, Robert Carroll, Gunter Wagner, Heinz- 
Albert Becker and Wolf-Eckhart Lénnig, Stuart Newman and Gerd Miller, 
Stuart Kauffman, Peter Stadler, Heinz Saedler, James Valentine, Giuseppe 
Sermonti, James Shapiro and Michael Lynch, to name several—have raised 
questions about the adequacy of the standard neo-Darwinian mechanism, and/or 
the problem of evolutionary novelty in particular.4* For this reason, the 
Cambrian explosion now looks less like the minor anomaly that Darwin 
perceived it to be, and more like a profound enigma, one that exemplifies a 
fundamental and as yet unsolved problem—the origination of animal form. 


Part Three 


After Darwin, What? 
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The Post-Darwinian World and Self- 
Organization 


The year 2009 marked the 150th anniversary of the publication of The Origin of 
Species. In that year, the renowned Cambrian paleontologist Simon Conway 
Morris published an essay in the journal Current Biology titled “Walcott, the 
Burgess Shale and rumours of a post-Darwinian world,” assessing the current 
state of evolutionary biology. “Everywhere elsewhere in the Origin the 
arguments slide one by one skillfully into place, the towering edifice rises, and 
the creationists are left permanently in its shadow,” he wrote. “But not when it 
comes to the seemingly abrupt appearance of animal fossils.”! Instead, 
unresolved problems exposed by the Cambrian explosion have, in Conway 
Morris’s view, “opened the way to a post-Darwinian world.”* The evidence we 
reviewed in the previous sections of the book—evidence for a real, rather than 
merely apparent, explosion of animal form in the fossil record, and against the 
neo-Darwinian mechanism as an explanation for the origin of form and 
information—may help to explain why biology has begun to enter such a world. 
Moreover, any doubts that at least some biologists have begun to embrace a 
post-Darwinian perspective should have been laid to rest in the summer of 2008, 
when sixteen influential evolutionary biologists met for a private conference at 
the Konrad Lorenz Institute in Altenberg, Austria. The scientists, whom the 
science media later dubbed the “Altenberg 16,”° met to explore the future of 
evolutionary theory. These biologists had many different, and sometimes 
conflicting, ideas about how new forms of life might have evolved. But all were 
united by the conviction that the neo-Darwinian synthesis had run its course and 
that new evolutionary mechanisms were needed to explain the origin of 
biological form. As paleontologist Graham Budd, who was in attendance, 
explained, “When the public thinks about evolution, they think about [things 
like] the origin of wings. . . . But these are things that evolutionary theory has 


told us little about.”4 

Of course, explaining the origin of form is precisely what has made the 
Cambrian explosion so mysterious. In Chapter 7, in discussing the idea of 
punctuated equilibrium, I quoted Cambrian paleontologists James Valentine and 
Douglas Erwin, who concluded exactly that. They argued that neither punctuated 
equilibrium nor neo-Darwinism has accounted for the origin of new body plans 
and that, consequently, biology needs a new theory to explain “the evolution of 
novelty.”° 

The Altenberg 16 sought to address this challenge. Since the conference, and 
for nearly two decades preceding it, many evolutionary biologists have been 
working to formulate new theories of evolution, or at least new ideas about 
evolutionary mechanisms with more creative power than mutation and natural 
selection alone. Each of these new theories attempts to answer the increasingly 
urgent question: After Darwin—or neo-Darwinism—what ? 


THE NEO-DARWINIAN TRIAD THE NEO-DARWINIAN MECHANISM 
RESTS ON THREE CORE CLAIMS: FIRST, THAT EVOLUTIONARY 
CHANGE OCCURS AS THE RESULT OF RANDOM, MINUTE 
VARIATIONS (OR MUTATIONS); SECOND, THAT THE PROCESS OF 
NATURAL SELECTION SIFTS AMONG THOSE VARIATIONS AND 
MUTATIONS, SUCH THAT SOME ORGANISMS LEAVE MORE 
OFFSPRING THAN OTHERS (DIFFERENTIAL REPRODUCTION) BASED 
ON THE PRESENCE OR ABSENCE OF CERTAIN VARIATIONS; AND 
THIRD, FAVORED VARIATIONS MUST BE INHERITED FAITHFULLY IN 
SUBSEQUENT GENERATIONS OF ORGANISMS, THUS CAUSING THE 
POPULATION IN WHICH THEY RESIDE TO CHANGE OR EVOLVE OVER 
TIME.®° BIOLOGISTS MARC KIRSCHNER AND JOHN GERHART CALL 
THESE THREE ELEMENTS—VARIATION, NATURAL SELECTION, AND 
HERITABILITY—THE “THREE PILLARS” OF NEO-DARWINIAN 
EVOLUTION.’ 


Those evolutionary biologists who now doubt orthodox neo-Darwinian 
theory typically question or reject one or more of the elements of this neo- 
Darwinian triad (see Fig. 16.1). Eldredge and Gould questioned Darwinian 
gradualism, which led them to reject the idea that mutational change occurs in 
minute increments (i.e., the first element of the neo-Darwinian triad just 
mentioned). Other evolutionary biologists have since rejected other core 


elements of the neo-Darwinian mechanism and sought to replace them with 
other mechanisms or processes. This chapter will examine a new class of post- 
neo-Darwinian evolutionary models that attempt to explain the origin of 
biological form by deemphasizing the role of random mutations. These models 
instead emphasize the importance of “self-organizational” laws or processes to 
the evolution of biological form. 


SELF-ORGANIZATIONAL MODELS WELL BEFORE THE ALTENBERG 
16 CONVENED, A SIGNIFICANT NUMBER OF EVOLUTIONARY 
THEORISTS HAD ALREADY BEGUN TO EXPLORE ALTERNATIVES TO 
THE NEO-DARWINIAN SYNTHESIS. PUNCTUATED EQUILIBRIUM WAS 
ONE SUCH ALTERNATIVE. BUT AS SCIENTIFIC CRITICISMS OF THAT 
THEORY BEGAN TO MOUNT DURING THE 1980S AND 1990S, A GROUP 
OF SCIENTISTS ASSOCIATED WITH A THINK TANK IN NEW MEXICO, 
THE SANTA FE INSTITUTE, DEVELOPED A NEW THEORETICAL 
APPROACH. THEY CALLED IT “SELF-ORGANIZATION.” 


Whereas neo-Darwinism explains the origin of biological form and structure 
as the consequence of natural selection acting on random mutations, self- 
organizational theorists suggest that biological form often arises (or “self- 
organizes”) spontaneously as a consequence of the laws of nature (or “laws of 
form”). Natural selection, they theorize, acts to preserve this spontaneously 
arising order. They think spontaneous self-organizing order, not random genetic 
mutations, typically provides the ultimate source of new biological form. Thus, 
they deemphasize two of the three parts of the classical neo-Darwinian triad: 
random mutations and, to a lesser extent, natural selection. 

In 1993, the most prominent scientist associated with the Santa Fe Institute, 
former University of Pennsylvania biochemist Stuart Kauffman (see Fig. 15.1), 
released an eagerly awaited treatise, The Origins of Order: Self-Organization 
and Selection in Evolution.® Kauffman articulated a trenchant critique of the 
creative power of the mutation and selection mechanism, emphasizing some of 
the criticisms described here in previous chapters. Kauffman advanced a 
comprehensive alternative theory to account for the emergence of new form. In 
addition, he advanced a specific proposal for explaining the Cambrian 
explosion.? 





FIGURE 15.1 
Stuart Kauffman. Courtesy Wikimedia Commons, user Teemu Rajala. 


Kauffman notes that the development of animal body plans involves two 
phases: cell differentiation and body-plan morphogenesis (cell organization). He 
explores the possibility that self-organizational processes at work today— 
specifically in cell differentiation and body-plan formation—might help explain 
how new animal forms originated in the past. 

Kauffman proposes, first, that gene regulatory networks in animal cells— 
genes that regulate other genes—influence cell differentiation. They do this by 
generating predictable “pathways of differentiation,”!° patterns by which one 
type of cell will emerge from another over the course of embryological 
development as cells divide. For example, early in embryological development, 
one type of cell (call it cell type “A”) will divide and give rise to two other types 
of cells (call them types “B” and “C”), which will eventually generate cell types 
“D” and “E,” and “F and G,” respectively, and many other cell types as the 
process continues. Kauffman suggests that these pathways of differentiation 
“may reflect self-organizing features of complex genomic regulatory 
networks.”!! In other words, networks of regulatory genes in embryonic cells 
determine the pathways by which cells divide and differentiate. Since these 
patterns of cell differentiation may be determined by regulatory genes, Kauffman 
regards them as the inevitable by-products of self-organizational processes. 


Moreover, since “pathways of cell differentiation [have been] present in all 
multicellular organisms presumably since the Precambrian,”!* he suggests that 
self-ordering properties “inherent in a wide class of genomic regulatory 
networks”!? played a significant role in the origin of the animal forms. 

Kauffman makes a similar case for the importance of self-organizational 
processes during body-plan morphogenesis, the second phase of animal 
development. This phase involves not so much the differentiation of one cell 
type from another, but the arrangement and organization of different cell types 
into the distinct tissues and organs that jointly constitute various animal body 
plans. 

Kauffman again points to known processes of body-plan development and 
suggests that they could have played an important role in the formation of the 
first animal body plans. He cites the importance of the structural or 
“positional”!4 information in cells and cell membranes as the crucial 
determinants of how different cell types are organized into different animal 
forms. I discussed the importance of such “epigenetic” information to animal 
development in Chapter 14 and explained why it poses a problem for neo- 
Darwinian theory. By recognizing the importance of such information, Kauffman 
also rejects the neo-Darwinian assumption that a “genetic program” entirely 
determines animal development. He further regards the patterns of development 
that result from this positional information as evidence of self-ordering 
tendencies in matter and the existence of laws of biological form. 

Do these self-ordering tendencies or laws of form, if they exist, explain the 
origin of animal body plans and the information necessary to build them? They 
don’t. 


Self-Organization and Epigenetic Information To see why let’s look first at 
how Kauffman attempts to explain the epigenetic “positional” information that 
directs the organization of cells in the second phase of animal development. 
Kauffman attempts to explain this “positional” information by offering an 
entirely hypothetical and, ultimately, question-begging proposal. He invokes an 
idea sketched out in the 1940s by the famous English mathematician Alan 
Turing.'° Turing proposed that specific arrangements of cells in animal 
development might ultimately derive from the diffusion and specific 
arrangement of crucial molecules—presumably something like the morphogen 
proteins present in embryonic cells. (Recall that morphogens, or morphogen 


proteins, influence cell differentiation and organization during animal 
development.) Rather than attributing the distribution of these morphogenic 
proteins to preexisting genetic and epigenetic information in cells, as occurs 
during development in modern animals, Turing postulated that the distribution of 
these molecules might have originated in the first place independently of such 
information as the result of simple chemical reactions. He imagined one 
molecule producing both a copy of itself (“autocatalyzing”) and, in addition, 
producing a different molecule as well. Then he envisioned one of these 
molecules inhibiting the production of the other, thus allowing, through repeated 
cycles, the production of more and more of one molecule and less and less of the 
other. Turing thought the resulting nonuniformities in the patterns of distribution 
of those molecules would eventually result in nonuniform patterns in the 
distributions of different cells, possibly resulting in different animal forms. 


Kauffman expanded upon this proposal as a way of understanding how 
crucial positional information might organize as the result of chemical 
interactions of different molecules. Nevertheless, his proposal suffers from an 
obvious difficulty: it lacks any chemical or biological specificity. In explaining 
the proposal, Kauffman does not mention any specific chemicals or proteins that 
would behave in the way he envisions. Instead, he describes the behavior of 
hypothetical molecules that he labels with the indistinct monikers “X” and “Y.” 
More important, Kauffman offers no evidence that chemicals interacting in the 
way he envisions could create specific biologically relevant configurations or 
distributions of morphogen proteins—apart, that is, from the processes that 
generate specifically arranged distributions of these proteins in preexisting 
information-rich embryonic cells today. 

Instead, it is inherently implausible to think that the specificity necessary to 
coordinate the movements and arrangements of the billions or trillions of cells 
present in adult animal forms could be established by the interactions of one or 
two simple chemicals, even if they formed autocatalytic cycles. Kauffman 
himself seems tacitly to acknowledge the difficulty of generating biological 
specificity from the reactions of chemicals alone. He notes, in critique of his 
own model, that patterns of molecular diffusion produced by chemical 
autocatalysis would depend crucially upon “the initial conditions.”!® In other 
words, getting a biologically relevant information-rich arrangement of 
morphogenic proteins would require starting with a very specific (presumably 
information-rich) arrangement of autocatalyzing molecules. 


Kauffman encounters this same problem in attempting to explain the origin of 
the first life as the result of autocatalytic reactions starting from a prebiotic soup. 
In The Origins of Order, he acknowledges that generating an autocatalytic, or 
self-reproducing, set of molecules—a crucial step in his origin-of-life scenario— 
would require “high molecular specificity”’’ in the initial set of peptides or RNA 
molecules. In other words, it would require specificity of arrangement and 
structure, that is to say, functional information. 


Self-Organization and Genetic Information And what about the specifically 
genetic information necessary to the earlier phase of animal development? Does 
Kauffman’s self-organizational theory explain the origin of the “genetic 
regulatory networks” necessary to cell differentiation? Again, it does not. 
Instead, in an even more obvious way, it begs the question of the origin of these 
regulatory networks. Indeed, though Kauffman discusses cell differentiation as a 
kind of “self-ordering” or self-organizational process, he acknowledges that the 
predictable pathways of differentiation that characterize this process derive from 
preexisting gene regulatory networks. As Kauffman notes, the spontaneous 
ordering tendencies in cell differentiation are “inherent in a wide class of 
genomic regulatory networks.”!® Indeed, the genetic information in the gene 
regulatory networks does not come from self-ordering processes of cell 
differentiation. Instead, cell differentiation, to the extent that it can be properly 
described as “self-ordering,” results from preexisting genetic sources of 
information. Thus, the self-organizational process that Kauffman cites cannot 
explain the origin of genetic information, because it derives from it, as 
Kauffman’s own description reveals. 


In a later book, At Home in the Universe: The Search for the Laws of Self- 
Organization and Complexity, Kauffman does offer computer simulations of two 
“model systems”!’ that seek to explain, at least in principle, how genetic 
information might have self-organized. In one example, he describes a system of 
buttons connected by strings.*? The buttons represent novel genes or proteins 
and the strings represent self-organizational forces of attraction between the 
proteins. Kauffman suggests that when the complexity of this system reaches a 
critical threshold (as represented by the number of buttons and strings), new 
modes of organization might arise in the system “for free”*!—without intelligent 
guidance—similar to the way that water spontaneously changes to ice or vapor 
under specific conditions. 


Kauffman asks his readers to imagine a system of many interconnected lights. 
Each light can flash in a variety of states—on, off, twinkling, and so on. Since 
each light can adopt more than one possible state, the system may adopt a vast 
number of possible states. Further, in his system, rules determine how past states 
influence future states. Kauffman asserts that, as a result of these rules, the 
system would, if properly tuned, eventually produce a kind of order in which a 
few basic patterns of light activity recur with greater than random frequency. 
Since these patterns represent a small portion of the total number of possible 
states in which the system can reside, Kauffman suggests that self-organizational 
laws might similarly find highly improbable biological outcomes—perhaps even 
functional sequences of bases or amino acids within a much larger sequence 
space of possibilities.*7 

It’s not hard to see why these simulations also would fail to account for the 
origin of the new genes and proteins needed to produce the Cambrian animals. In 
both of his examples Kauffman presupposes significant sources of preexisting 
information. In his buttons and strings simulation, he intends the buttons to 
represent proteins, themselves the result of preexisting genetic information. 
Where did that information come from? Kauffman doesn’t say, but it is an 
essential part of what needs explanation in the history of life. Similarly, in his 
light system, the order that allegedly arises “for free”—that is, apart from an 
intelligent input of information—only does so if, as Kauffman acknowledges, 
the programmer “tunes” the system to keep it from either (a) generating an 
excessively rigid order or (b) devolving into chaos.*? This tuning presumably 
involves an intelligent programmer selecting certain parameters and excluding 
others—that is, inputting information. In fact, in summarizing the import of this 
illustration, Kauffman insists that it shows how the “orderliness of the cell, long 
attributed to the honing of Darwinian evolution, seems instead likely to arise 
from the dynamics of the genome network,”** that is, from preexisting— 
unexplained—sources of genetic information. 

In addition, Kauffman’s model systems are not analogous to biological 
systems because they are not constrained by functional considerations. A system 
of interconnected lights governed by preprogrammed rules may well settle into a 
small number of patterns within a much larger space of possibilities. But since 
these patterns have no function and need not meet any functional requirements, 
they have no specificity analogous to that in the genes of actual organisms. 
Kauffman’s model systems do not produce sequences or systems characterized 
by specified complexity or functional information. They produce modules of 


repetitive order distributed in an aperiodic manner, yielding mere complexity 
(i.e., information only in the Shannon sense).*° Getting a law-governed system 
to generate repetitive patterns of flashing lights, even with a certain amount of 
variation, is interesting, but not biologically relevant. A system of lights flashing 
“Vote for Jones,” on the other hand, would model a biologically relevant 
outcome, at least, if such a functional sequence of letters arose without 
intelligent agents programming the system with equivalent amounts of 
functionally specified information. 


Kauffman on the Cambrian Kauffman also proposes a specific self- 
organizational mechanism to explain some aspects of the Cambrian explosion. 
According to Kauffman, new Cambrian animals emerged through “long-jump” 

mutations that established new body plans in a discrete rather than gradual 
fashion.*° He recognizes that mutations affecting early development are almost 
inevitably harmful.*” Thus he concludes that body plans, once established, will 
not change, whatever subsequent evolution may occur. This keeps his proposal 
consistent with a top-down pattern in the fossil record in which higher taxa (and 
the body plans they represent) appear first, only later to be followed by the 
multiplication of lower taxa representing variations within those original body 
designs. 


Even so, Kauffman’s proposal begs the most important question: What 
produces the new Cambrian body plans in the first place? By invoking “long- 
jump mutations,” he identifies no specific self-organizational process that can 
produce such changes. Moreover, he concedes a principle that undermines his 
own proposal. As noted above, Kauffman acknowledges that mutations early in 
development are almost inevitably deleterious. Yet developmental biologists 
know that these are the only kind of mutations that have a realistic chance of 
producing large-scale evolutionary change—the big jumps that Kauffman 
invokes. Though Kauffman repudiates the neo-Darwinian reliance upon random 
mutations, he must invoke the most implausible kind of random mutation to 
provide a self-organizational account of the new Cambrian body plans. 


DEVELOPMENTAL TOOLKITS AND SELF-ORGANIZATIONAL 
PROCESSES MORE RECENTLY, ANOTHER ADVOCATE OF SELF- 
ORGANIZATION, STUART NEWMAN, A CELL BIOLOGIST AT THE NEW 
YORK MEDICAL COLLEGE, HAS PUBLISHED SEVERAL PAPERS 


SUGGESTING THAT SELF-ORGANIZATIONAL PROCESSES CAN HELP 
EXPLAIN THE ORIGIN OF BODY PLANS. IN A PAPER IN THE VOLUME 
PRODUCED FROM THE ALTENBERG 16 CONFERENCE, NEWMAN 
DEVELOPS A MODEL THAT RESEMBLES KAUFFMAN’S, BUT ONE 
THAT OFFERS MORE BIOLOGICAL SPECIFICITY.7® 


Newman, like Kauffman, invokes self-organizational processes. But Newman 
sees these processes acting dynamically and in coordination with a genetic 
“toolkit.” His model emphasizes the importance of a highly conserved (i.e., 
similar) set of regulatory genes in all the major Cambrian taxa. In his view, this 
common “developmental genetic toolkit”*? has been used “to generate animal 
body plans and organ forms for more than half a billion years”°° since the 
inception of the animal kingdom. 

But if all the animal taxa have the same toolkit, why are the various forms of 
animals and higher metazoan taxa so different from one another? For Newman, 
the answer to this question requires understanding how self-organizing processes 
influence the interaction of cells during development and how they cause genes 
to acquire different functions affecting the interactions of cells. 

For example, he attributes the emergence of multicellularity to cells acquiring 
the capacity “to remain attached to one another after dividing.”°! This capacity 
in tum derives not from generating new genes and proteins (as neo-Darwinism 
would assume). Instead, it derives from the repurposing of old genes and 
proteins in response to specific self-organizational (and epigenetic) processes 
such as the “physical force of adhesion.”°* Newman proposes, further, that once 
the first multicellular organisms had arisen, they would have “set the stage for 
additional physical processes to come into play”*’—processes that could alter 
the expression and function of still other genes in the developmental genetic 
toolkit, resulting in wholly new and different body plans. As Newman explains, 
“The phenomenon of multicellularity opened up possibilities for these molecules 
to become involved in the molding of bodies and organs.”** 

Newman envisions new animal body plans resulting from different cells 
sticking to each other in different configurations because of different forces of 
attraction between molecules on the surface of cells and because of different 
patterns of distribution of crucial molecules within cells. He calls these self- 
organizational forces and factors “dynamical patterning modules” (or DPMs).°° 
Figure 15.2 shows some typical ways that cells cluster together or arrange 
themselves as the result of these self-organizational forces. Newman lists many 


’ 


“dynamical patterning modules,” or self-organizational forces, responsible for 
the spontaneous emergence of these different cell clusters, including “adhesion, 
shape and surface polarization, switching between alternative biochemical states, 
biochemical oscillation, and the secretion of diffusible and nondiffusible 








factors.”2° 
DYNAMICAL PATTERNING MODULES (DPMs) {4 
DPM NAME  WHATIT DOES WHAT IT LOOKS LIKE 
Ait CAUSES ADHESTON AMONG A GROUP OF 
(ADHESTON) UNAGGREGATED CELLS, ALLOWING FoR 
MULTICELLULARITY. 
LAT TAKES A GROUP CF AGGREGATED CELLS AND 
(LATERAL ALLOWS CO-EXISTENCE OF ALTERNATIVE 
INHIBITION) CELL STATES WITHIN THE GROUP. 
DAD TAKES AN AGGREGATED GROUP OF DIFFERENT 
(DIFFERENTIAL TYPES OF CELLS AND ALLOWS SEPARATION OF 
ADHESION) CELLS INTO MULTILAYERED TISSUES. 
Pol, 
(POLARITY, TAKES A GROUP CF AGGREGATED CELLS AND 
APICAL-BASAL) CAUSES FORMATION OF INTERIOR CAVITIES. 
POL, TAKES An AGGREGATED GROUP OF CELLS 
(POLARITY, AND CAUSES ELONGATION OF TISSUES 
PLANAR) WITHIN A PLANE. 
Sc TAKES AN ELONGATED TISSUE AND CHEMICALLY 
(CHEMICAL INDUCES OSCILLATION OF CELL PATTERNS, 
OSCILLATION) ALLOWING SEGMENTATION OF A BODY PLAN. 
FIGURE 15.2 


Dynamical Patterning Modules (DPMs), showing the different ways that, 
according to biologist Stuart Newman, cells can stick to one another 
(“aggregate”) and form structures during animal development. 


To get a handle on what Newman has in mind, think of cells as Lego blocks. 
There are many different ways of connecting Lego blocks, depending on the 
shape of the blocks and the pattern of raised bumps and indentations on the 
blocks. These patterns allow small groups of Legos to be arranged into different 
modular structures: cubes, walls, circular rings, and so forth. Each of these 
smaller modular structures can then be combined to make many different larger 
structures, from airplanes and skyscrapers to submarines and castles. In a similar 
way, Newman suggests that different forces of adhesion between cells and 
different patterns of molecular diffusion within and between cells will generate 
many different patterns or motifs of multicellular organization, which in turn 


function as modular elements that can be combined in various ways to make 
diverse animal forms.” 

Do these self-organizational processes account for the origin of animal body 
plans in the Cambrian explosion or the information necessary to produce new 
animal forms? Again, they do not. Instead, Newman, like Kauffman, either fails 
to offer an adequate mechanism for generating crucial sources of biological 
information, or he begs the question by presupposing the existence of various 
sources of information. 


ASSUME A TOOLKIT IN THE FIRST PLACE, NEWMAN OBVIOUSLY 
PRESUPPOSES THE EXISTENCE OF A “DEVELOPMENTAL GENETIC 
TOOLKIT,” THAT IS, A WHOLE SET OF GENES, INCLUDING 
REGULATORY GENES, THAT HELP TO DIRECT THE DEVELOPMENT 
OF ANIMAL BODY PLANS. WHERE DOES THIS GENETIC 
INFORMATION COME FROM? HE DOESN’T SPECIFY, THOUGH 
PRESUMABLY HE MIGHT BE ASSUMING THE NEO-DARWINIAN 
MECHANISM SOMEHOW PRODUCED THE GENETIC INFORMATION IN 
THE TOOLKIT. IF SO, HE LEAVES HIS MODEL VULNERABLE TO THE 
CRITICISMS OUTLINED IN CHAPTERS 9 THROUGH 12. HE CERTAINLY 
DOES NOT CITE ANY SPECIFICALLY SELF-ORGANIZATIONAL 
PROCESS TO EXPLAIN THE ORIGIN OF THE GENETIC TOOLKIT. HE 
ALSO INCORRECTLY SEEMS TO PRESUPPOSE THAT THE GENES 
PRESENT IN THIS COMMON TOOLKIT PROVIDE ALL THE GENETIC 
INFORMATION NECESSARY TO SPECIFY INDIVIDUAL BODY PLANS. 
BUT THIS OVERLOOKS A HOST OF RECENT FINDINGS SHOWING 
THAT INDIVIDUAL SPECIES WITHIN SPECIFIC TAXA OFTEN REQUIRE 
GENES FOR DEVELOPMENT THAT ARE SPECIFIC TO THOSE SPECIES 
AND TAXA.*° THUS, THESE GENES WOULD NOT HAVE BEEN 
PRESENT IN A COMMON METAZOAN TOOLKIT OF THE KIND 
NEWMAN POSTULATES. 


Second, Newman does not account for the origin of the information necessary 
to organize modular arrangements or groups of cells into whole animal body 
plans. The forces at work in his dynamical patterning modules explain, at best, 
only the arrangements of small groups of cells, not the arrangements of those 
modular cell clusters into tissues, organs, and whole body plans. 

Think, again, of arranging Lego blocks. There are many ways of arranging 


small numbers of Lego blocks. These various arrangements form common 
structural motifs such as: two blocks stuck together at right angles; several 
curved blocks forming circular rings, stacked blocks forming hollow squares or 
walls or cube-like shapes; blocks arranged as prisms or cylinders; flat layers of 
blocks stacked two bumps thick or three bumps thick or more. Though these 
structural elements stick together because of interactions between the bumps and 
indentations on each block, those bumps and indentations themselves do not 
specify any particular larger structure—a castle or an airplane, for example— 
because each motif may be combined or recombined with many other structural 
motifs in numerous different ways. The shape and properties of the modular 
elements do not dictate the type of larger structure that must be built from them. 
Instead, to build a particular structure, the modular elements must be arranged in 
particular ways. And since there are many possible ways to arrange these 
modular elements, only one or a few of which will result in a desired structure, 
every Lego set includes a blueprint with step-by-step instructions—in other 
words, additional information. 

In a similar way, producing a body plan from the different types of cell 
clusters generated by Newman’s dynamical patterning modules (DPMs), would 
also require additional information. Newman does not account for this 
information. He correctly highlights the way certain recurrent motifs for 
organizing groups of cells seem to form spontaneously as the result of physical 
interactions between individual cells (his DPMs). He does not, however, 
establish that these groups of cells must arrange themselves into specific tissues, 
organs, or body plans in response to any known physical process or law. Instead, 
it seems entirely possible that these modular elements (cell clusters) have many 
“degrees of freedom” and can be arranged in innumerable ways. If so, then some 
additional information—an overall organismal blueprint or set of assembly 
instructions—would need to direct the arrangement of these modular elements. 
Newman does not consider this possibility. Nor does he cite any law-like self- 
organizational process that would eliminate the need for such information to 
direct animal development. 

There is yet a further problem with Newman’s proposal. Even the capacity for 
cells to self-organize into dynamical patterning modules probably derives from 
prior unexplained sources of information. Newman’s DPMs undoubtedly form as 
the result of interactions between molecules on the surface of cells, and as the 
result of chemical gradients between cells—with the specific configuration and 
properties of those molecules determining the exact structure of the individual 


DPMs. In that sense, the DPMs do, of course, self-organize, but clearly the 
specific ways that cells typically cluster together will depend upon highly 
specific and complex forces of interaction between the molecules and groups of 
molecules on the surface of these cells. Many of the molecules that contribute to 
these interactions are no doubt proteins, obvious products of genetic information. 
But, in addition, cell-to-cell interactions are affected by the arrangement of 
proteins and other molecules on the surface of cells (such as the sugars in the 
sugar code, see Chapter 14) as well as by the arrangement of structures made of 
proteins. But these molecular arrangements are, in turn, specified by either 
preexisting genetic or, more probably, epigenetic sources of information and 
structure. Thus, Newman’s analysis shows that the self-ordering tendencies (or 
biological laws of form), to the extent they exist, depend upon preexisting 
sources of biological information. Newman, again, does not explain where that 
information comes from. 

Newman does emphasize how epigenetic sources of information affect the 
expression and the function of gene products during the process of animal 
development. He notes that different gene products may perform different 
functions depending upon the organismal context in which they find themselves. 
But Newman does not explain with any specificity how the genes in the common 
toolkit acquire different functions in response to self-organizational processes or 
where the epigenetic information comes from that determines those functions. 
Yet, clearly context-dependent gene expression depends upon a host of other 
preexisting epigenetic sources of information and structure. 

Recall, for example, the discussion of cell membrane targets from Chapter 14. 
These targets provide an important source of epigenetic information by 
influencing the positioning of crucial morphogenic proteins. Yet, the 
arrangements of the targets on the cell membrane do not self-organize as the 
result of simple chemical interactions between the proteins out of which they are 
made*’—that is, the proteins do not determine the location of the membrane 
targets on the interior of the cell. Instead, the location and the structure of 
membrane targets are transmitted from parent membrane to daughter membrane, 
a process that transmits preexisting epigenetic structural information from the 
parent membrane. Newman does not attempt to explain the origin of this 
information or structure by any known self-organizational process. Instead, the 
evidence indicates that interior and exterior membrane targets, cytoskeletal 
arrays, the sugar code, and many other sources of epigenetic structural 
information do not self-organize as the result of physical interactions between 


their respective molecular subunits.*° 


ORDER VS. INFORMATION SELF-ORGANIZATIONAL THEORISTS 
FACE, IN ADDITION, A CONCEPTUAL DISTINCTION THAT HAS CAST 
DOUBT ON THE RELEVANCE OF THEIR THEORIES TO BIOLOGICAL 
SYSTEMS. SELF-ORGANIZATIONAL THEORISTS SEEK TO EXPLAIN 
THE ORIGIN OF “ORDER” IN LIVING SYSTEMS BY REFERENCE TO 
PURELY PHYSICAL OR CHEMICAL PROCESSES (OR LAWS 
DESCRIBING THOSE PROCESSES). BUT WHAT NEEDS TO BE 
EXPLAINED IN LIVING SYSTEMS IS NOT MAINLY ORDER IN THE 
SENSE OF SIMPLE REPETITIVE OR GEOMETRIC PATTERNS. INSTEAD, 
WHAT REQUIRES EXPLANATION IS THE ADAPTIVE COMPLEXITY 
AND THE INFORMATION, GENETIC AND EPIGENETIC, NECESSARY 
TO BUILD IT. 


Yet advocates of self-organization fail to offer examples of either biological 
information or complex anatomical structures arising from physics and 
chemistry alone. They either point, as Newman and Kauffman do, to 
embryological development unfolding predictably as the result of preexisting 
information-rich gene products, cell membranes, and other cell structures. Or 
they offer examples of purely physical and chemical processes generating a kind 
of order that has little relevance to the features of living systems that most need 
explanation. 

In the latter case, self-organizational theorists often point to simple geometric 
shapes or repetitive forms of order arising from or being modified by purely 
physical or chemical processes. They suggest that such order provides a model 
for understanding the origin of biological information or body-plan 
morphogenesis.*! Self-organizational theorists have cited crystals, vortices, and 
convection currents (or stable patterns of flashing lights) to illustrate the 
supposed power of physical processes to generate “order for free.” Crystals of 
salt do form as the result of forces of attraction between sodium and chloride 
ions; vortices can result from gravitational and other forces acting on water in a 
draining bathtub; convection currents do emerge from warm air (or molten rock) 
rising in enclosed spaces. And some molecules found in living systems do adopt 
highly ordered structures and recognizable geometric shapes as the result of the 
physical interactions of their constituent parts alone. Nevertheless, the type of 
order evident in these molecules or physical systems has nothing to do with the 


specific “order” of arrangement—the information or specified complexity—that 
characterizes the digital code in DNA and other higher-level information-rich 
biological structures. 

This is easiest to see in the case of the information encoded in DNA and 
RNA. Some of what follows may be familiar from my discussion in Chapter 8, 
but it bears repeating. The bases in the coding region of a section of DNA or in 
an RNA transcript are typically arranged in a nonrepetitive or aperiodic way. 
These sections of genetic text display what information scientists call 
“complexity,” not simple “order” or “redundancy.” 

To see the difference between order and complexity consider the difference 
between the following sequences: Na-Cl-Na-Cl-Na-Cl-Na-Cl 
AZFRT<MPGRTSHKLKYR 

The first sequence, describing the chemical structure of salt crystals, displays 
what information scientists call “redundancy” or simple “order.” That’s because 
the two constituents, Na and Cl (sodium and chloride), are highly ordered in the 
sense of being arranged in a simple, rigidly repetitive way. The sequence on the 
bottom, by contrast, exhibits complexity. In this randomly generated string of 
characters, there is no simple repetitive pattern. Whereas the sequence on the top 
could be generated by a simple rule or computer algorithm, such as “Every time 
Na arises, attach a Cl to it, and vice versa,” no rule shorter than the sequence 
itself could generate the second sequence. 

The information-rich sequences in DNA, RNA, and proteins, by contrast, are 
characterized not by either simple order or mere complexity, but instead by 
“specified complexity.” In such sequences, the irregular and unpredictable 
arrangement of the characters (or constituents) is critical to the function that the 
sequence performs. The three sequences below illustrate these distinctions: Na- 
Cl-Na-Cl-Na-Cl-Na-Cl (Order) AZFRT<MPGRTSHKLKYR (Complexity) Time 
and tide wait for no man (Specified complexity) What does all this have to do 
with self-organization? Simply this: the law-like, self-organizing processes that 
generate the kind of order present in a crystal or a vortex do not also generate 
complex sequences or structures; still less do they generate specified complexity, 
the kind of “order” present in a gene or functionally complex organ. 

Laws of nature by definition describe repetitive phenomena—order in that 
sense—that can be described with differential equations or universal “if-then” 
statements. Consider, for example, these informal expressions of the law of 
gravity: “All unsupported bodies fall” or “If an elevated body is left 
unsuspended, then it will fall.” These statements represent reasonably accurate 


law-like descriptions of natural gravitational phenomena precisely because we 
have repeated experience of unsupported bodies falling to the earth. In nature, 
repetition provides grist for lawful description. 

The information-bearing sequences in protein-coding DNA and RNA 
molecules do not exhibit such repetitive “order,” however. As such, these 
sequences can be neither described nor explained by reference to a natural law or 
law-like “self-organizational” process. The kind of nonrepetitive “order” on 
display in DNA and RNA—a precise sequential “order” necessary to ensure 
function—is not the kind that laws of nature or law-like self-organizational 
processes can—in principle—generate or explain. 

Otherwise, the nucleotide bases would repeat rigidly—such as 
ACACACACACACACACAC—in a way that would not allow DNA to store or 
convey specified information. A curious feature of the chemistry of DNA allows 
any one of the four nucleotide bases to attach to any site on the interior backbone 
of the DNA molecule. This chemical indeterminacy makes it possible for DNA 
and RNA to store any one of a virtually unlimited number of different 
arrangements of nucleotide bases—in effect, to encode any genetic message. But 
this indeterminacy also categorically defies explanation by deterministic law-like 
forces of chemical attraction. And because forces of attraction do not determine 
the sequence of nucleotide bases in DNA or RNA, the origin of the specific 
arrangement of the bases—the information in DNA and RNA—cannot be 
attributed to self-organizing forces of attraction either. 

Hubert Yockey, a leading innovator in the application of information theory 
to molecular biology, first recognized the problems associated with invoking 
self-organization to explain the origin of biological information. These theories 
fail, he argued, for two reasons. First, they do not distinguish order from 
information. And, second, the information in the DNA molecule does not derive 
from law-like forces of attraction.4* As he explained in 1977: “Attempts to relate 
the idea of order . . . with biological organization or specificity must be regarded 
as a play on words that cannot stand careful scrutiny. Informational 
macromolecules can code genetic messages and therefore can carry information 
because the sequence of bases or residues is affected very little, if at all, by [self- 
organizing] physicochemical factors.”*° 

Much the same thing is true of many vital sources of epigenetic information. 
The forces of attraction between constituent proteins in membrane targets or 
cytoskeletal arrays, for example, do not determine the structure or location of 
these epigenetic structures and the positional information they provide. The 


origin of these structures cannot be attributed to self-organizing forces of 
attraction either. Instead, in each case, information-rich epigenetic structures are 
generated from preexisting sources of epigenetic information. 

Thus, self-organizational theories explain well what does not mainly need 
explaining in biology, namely, repetitive or simple geometric forms of order. 
Self-organizational theorists do cite structures that might have self-organized. 
But these examples are typically extremely modest in scope. They include 
repeating patterns of atoms in crystals; simple geometric figures; patterns of 
lines, triangles, and streaks; vortices; spiral wave currents; and simple shapes 
that glide across computer screens.** None exhibit the specified complexity that 
characterizes the digital information in DNA and RNA or the complex 
arrangements of proteins, cells, tissues, and organs necessary to build a 
functioning form of animal life. 


NATURAL MAGIC OR TRUE CAUSE? 


In 2007, I participated in a private meeting of evolutionary biologists and other 
scientists who shared the conviction that a new theory of biological origins is 
now needed. In attendance were several prominent advocates of the self- 
organization approach. During the meeting, these scientists presented intriguing 
analogies from physics and chemistry to show how order might have arisen “for 
free”—that is, without intelligent guidance—in the biological realm. Yet the 
order they described in these analogies seemed to have no direct relevance to the 
complexity—indeed the specified complexity—of genes or cell membranes or 
animal body plans. Other scientists at the conference challenged the advocates of 
self-organization to cite known processes that could produce biologically 
relevant form and information. 

Near the end of the meeting one advocate of self-organization privately 
acknowledged to me the validity of these critiques, admitting that, for now, “self- 
organization is really more of a slogan than a theory.” Stuart Kauffman, perhaps 
attempting to make a virtue of the necessity of accepting this explanatory deficit, 
has recently celebrated the self-organizational perspective for embracing what he 
calls “natural magic.” In a lecture at MIT, he concluded: “Life bubbles forth in a 
natural magic beyond the confines of entailing law, beyond mathematization.”*° 
He went on to explain that one benefit of the self-organizational perspective is 
that it allows us to be “reenchanted” with nature and to “find a way beyond 
modernity.”“° 


Since the beginning of modem science, scientists have championed a 
commonsense principle of scientific reasoning known as the vera causa 
principle. This principle holds that explaining a particular phenomenon or event 
requires identifying a “true cause” that is known from experience to have the 
power to produce the event or phenomenon in question. The early modern 
scientists affirmed this principle as one of the key aspects of a scientific 
approach to understanding nature. This stood in opposition to the magical 
thinking that had gone before in which people attributed powers to nature that 
they had never observed it manifesting. As the scientific revolution matured, 
endeavors such as alchemy, for example, were eventually rejected precisely 
because the alchemists could not identify a cause that could effect the 
transformation they were seeking to demonstrate. 

Self-organizational theories have clearly failed to provide a vera causa for the 
origin of biologically relevant forms of “order”—the functional complexity and 
specified information present in living systems. Instead, they either beg the 
question as to the ultimate origin of biological information or point to physical 
and chemical processes that do not produce the specified complexity that 
characterizes actual animals. 

Viewed in this light, Kauffman’s recent discussion of natural magic and calls 
for a “reenchantment” with nature sound less like a bold new initiative to 
reconcile science and spirituality (which is what he intended) than a tacit 
admission that self-organizational theories have failed to identify known physical 
and chemical processes capable of generating the form and information present 
in actual living systems. Indeed, after years of attempting to solve the problem of 
the origin of form, Kauffman’s recent ruminations about “natural magic” sound a 
lot like an admission that a profound mystery remains. 
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Other Post-Neo-Darwinian Models 


When Stephen Jay Gould was first wrestling with the question of how new 
forms of animal life could have arisen so quickly in the fossil record, he 
considered many possible mechanisms of change. In the famed 1980 paper in 
which he declared neo-Darwinism “effectively dead,”! he didn’t just propose 
allopatric speciation and species selections as new evolutionary mechanisms. He 
also granted a rehearing to a long discredited idea. Specifically, he argued that 
large-scale “macromutations” might generate significant innovations in form 
relatively quickly. 

In the 1930s and 1940s, this idea had been associated with University of 
California at Berkeley geneticist Richard Goldschmidt. Aware of the many 
discontinuities in the fossil record, Goldschmidt envisioned radical 
transformations in the form of animals arising in even one generation as the 
result of such large-scale mutations. He endorsed, for instance, the view of the 
German paleontologist Otto Schindewolf (1896-1971) that “the first bird 
hatched from a reptilian egg” and, thus, in Goldschmidt’s words, “that the many 
missing links in the paleontological record are sought for in vain because they 
have never existed.”° If a bird hatched directly from a reptilian egg as the result 
of heritable, large-scale mutations, then such a sudden leap or “saltation” would 
obviously leave no fossil intermediates behind. 

Neo-Darwinists rejected this idea as biologically implausible in the extreme. 
They argued that changing so many functionally integrated anatomical and 
physiological systems so quickly would inevitably result in deformed mutants, 
not different integrated systems of organs constituting a whole new animal.* 
Goldschmidt’s macromutations, they contended, would produce not what 
Goldschmidt called “hopeful monsters,” but “hopeless monsters”—that is, 
nonviable organisms.” 

Though Gould wanted to reconsider a role for large-scale mutations, he 


carefully disassociated his proposal from Goldschmidt’s much ridiculed idea. 
Instead, he suggested that the mutations affecting genes in animal development 
might generate larger increments of morphological innovation than the mutations 
that affected other genes. These “developmental mutations,” he thought, might 
generate modular parts of biological systems in a short time—without needing to 
generate whole new forms of life in a single generation. He offered, as an 
example, the possibility that the gill arch bones of ancient jawless fish, though 
not the whole fish, might have arisen in one step as the result of a developmental 
macromutation. Gould explained: “I do not refer to the saltational origin of 
entire new designs, complete in all their complex and integrated features. . . . 
Instead, I envisage a potential saltational origin for the essential features of key 
adaptations.”° 

In response to heavy criticism from neo-Darwinists, Gould later downplayed 
the role of such larger-scale developmental mutations in the theory of punctuated 
equilibrium. Other evolutionary biologists, however, took his idea as an 
inspiration and developed theories that emphasize such developmental mutations 
as a driving force in macroevolution. Evolutionary theorists and developmental 
biologists such as Rudolf Raff, Sean B. Carroll, and Wallace Arthur have 
developed a subdiscipline of biology known as evolutionary developmental 
biology, or “evo-devo” for short. The evolutionary developmental biologists 
have since formulated alternative models that challenge a key aspect of the neo- 
Darwinian triad. Whereas neo-Darwinism envisions new form arising as the 
result of slow, incremental accumulations of minor mutations, evolutionary 
developmental biologists argue that mutations affecting genes involved in animal 
development can cause large-scale morphological change and even whole new 
body plans. 

This chapter will not only examine the ideas of evolutionary developmental 
biologists, but three of the other most prominent alternatives to neo-Darwinism, 
some proposed by members of the Altenberg 16 (see Fig. 16.1). Each of these 
alternatives emphasizes certain elements of the “triad” at the expense of others. 
Whereas the self-organizational alternatives that I discussed in the last chapter 
emphasize the role of law-like processes over random mutations, these other new 
theories reaffirm the importance of mutations, though each also reconceptualizes 
how mutations act. One approach falls under the rubric of “evo-devo” and 
conceives of mutations producing modifications in larger increments. Another, 
the neutral theory of evolution, sees mutations acting absent selection. Another, 
neo-Lamarckian “epigenetic inheritance,” envisions heritable alterations in 


epigenetic information influencing the future course of evolution. Still another, 
called “natural genetic engineering,” affirms that nonrandom genetic 
rearrangements drive evolutionary innovation.’ 
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The tenets of neo-Darwinian orthodoxy and the different ways that various non- 
Darwinian models of evolution deviate from those tenets. The boxes 
representing new evolutionary models are positioned under the headings of the 
neo-Darwinian tenets that they challenge. 


Let’s see if one of these proposals solves the twin problems of the origin of 
form and information and whether, therefore, it might also help to resolve the 
mystery of the Cambrian explosion.® 


EVO-DEVO AND ITS PROPOSALS THE NEO-DARWINIAN SYNTHESIS 
HAS LONG EMPHASIZED THAT LARGE-SCALE 
MACROEVOLUTIONARY CHANGE OCCURS AS THE INEVITABLE BY- 
PRODUCT OF THE ACCUMULATION OF SMALL-SCALE 
“MICROEVOLUTIONARY” CHANGES WITHIN POPULATIONS. THE 
CONSENSUS IN SUPPORT OF THIS IDEA BEGAN TO FRAY IN 
EVOLUTIONARY BIOLOGY DURING THE EARLY 1970S, WHEN YOUNG 


PALEONTOLOGISTS SUCH AS GOULD, NILES ELDREDGE, AND 
STEVEN STANLEY REALIZED THAT THE FOSSIL RECORD DID NOT 
SHOW A PATTERN OF GRADUAL “MICRO-TO-MACRO” CHANGE. IN 
1980, AT A NOW FAMOUS SYMPOSIUM ON MACROEVOLUTION AT 
THE FIELD MUSEUM IN CHICAGO, THE REBELLION BURST INTO 
FULL VIEW, EXPOSING WHAT DEVELOPMENTAL BIOLOGIST SCOTT 
GILBERT CALLED “AN UNDERGROUND CURRENT IN 
EVOLUTIONARY THEORY” AMONG THEORISTS WHO HAD 
CONCLUDED THAT “MACROEVOLUTION COULD NOT BE DERIVED 
FROM MICROEVOLUTION.”? 


At the conference, paleontologists who doubted the “micro-to-macro” 
consensus found allies among younger developmental biologists. They were 
dissatisfied with neo-Darwinism in part because they knew that population 
genetics, its mathematical expression, sought only to quantify changes in gene 
frequency rather than explain the origin of genes or novel body plans. Thus, 
many developmental biologists thought that neo-Darwinism did not offer a 
compelling theory of macroevolution.'° 

To formulate a more robust theory, many developmental biologists, such as 
Rudolf Raff, a developmental biologist at the University of Indiana and one of 
the founders of “evo-devo,” urged evolutionary theorists to incorporate insights 
from their discipline.’ For example, developmental biologists know that 
mutations expressed early in the development of animals are necessary to alter 
body-plan morphogenesis. Thus, they argue that these mutations must have 
played a significant role in generating whole new animal forms during the 
history of life. They assert that this understanding of developmental processes is 
crucial to understanding animal evolution. Some evo-devo advocates such as 
Sean B. Carroll and Jeffrey Schwartz have pointed specifically to homeotic (or 
Hox) genes—master regulatory genes that affect the location, timing, and 
expression of other genes—as entities capable of producing such large-scale 
change in animal form.'* These evo-devo advocates have broken with classical 
neo-Darwinism primarily in their understanding of the size or increment of 
mutational change. 


MAJOR BUT NOT VIABLE, VIABLE BUT NOT MAJOR DESPITE THE 
ENTHUSIASM SURROUNDING THE FIELD, EVO-DEVO FAILS, AND 
FOR AN OBVIOUS REASON: ITS MAIN PROPOSAL, THAT EARLY- 


ACTING DEVELOPMENTAL MUTATIONS CAN CAUSE STABLY 
HERITABLE, LARGE-SCALE CHANGES IN ANIMAL BODY PLANS, 
CONTRADICTS THE RESULTS OF ONE HUNDRED YEARS OF 
MUTAGENESIS EXPERIMENTS.!? AS WE SAW IN CHAPTER 13, THE 
EXPERIMENTS OF SCIENTISTS SUCH AS NUSSLEIN-VOLHARD AND 
WIESCHAUS HAVE SHOWN DEFINITIVELY THAT EARLY-ACTING 
BODY-PLAN MUTATIONS INVARIABLY GENERATE EMBRYONIC 
LETHALS—DEAD ANIMALS INCAPABLE OF FURTHER EVOLUTION. 
THE RESULTS OF THESE EXPERIMENTS HAVE GENERATED THE 
DILEMMA FOR EVOLUTIONARY BIOLOGISTS THAT GENETICIST 
JOHN MCDONALD APTLY DESCRIBED AS THE “GREAT DARWINIAN 
PARADOX.” RECALL THAT MCDONALD NOTED THAT EARLY-ACTING 
REGULATORY MUTATIONS DO NOT PRODUCE VIABLE ALTERATIONS 
IN FORM THAT WILL PERSIST IN POPULATIONS, AS EVOLUTION 
ABSOLUTELY REQUIRES. INSTEAD, THESE MUTATIONS ARE 
ELIMINATED IMMEDIATELY BY NATURAL SELECTION BECAUSE OF 
THEIR INVARIABLY DESTRUCTIVE CONSEQUENCES. ON THE OTHER 
HAND, LATER-ACTING MUTATIONS CAN GENERATE VIABLE 
CHANGES IN THE FEATURES OF ANIMALS, BUT THESE CHANGES DO 
NOT AFFECT GLOBAL ANIMAL ARCHITECTURES. THIS GENERATES A 
DILEMMA: MAJOR CHANGES ARE NOT VIABLE; VIABLE CHANGES 
ARE NOT MAJOR. IN NEITHER CASE DO THE KINDS OF MUTATION 
THAT ACTUALLY OCCUR PRODUCE VIABLE MAJOR CHANGES OF 
THE KIND NECESSARY TO BUILD NEW BODY PLANS. 


In 2007, I coauthored a textbook with several colleagues titled Explore 
Evolution. In it, we explained this “either/or” (“major-not-viable, viable-not- 
major”) dilemma and suggested that it posed a challenge to theories that rely on 
the mutation and selection mechanism to explain the origin of major 
morphological changes.'* The National Center for Science Education (NCSE)— 
an influential activist group that opposes allowing students to learn about 
scientific criticisms of evolutionary theory—challenged our critique. They 
charged that our textbook “fails to acknowledge the extensive research on 
mutations in DNA sequences that do not encode proteins, but which have 
important morphological effects.”!° In other words, they claimed that some 
viable mutations do produce major large-scale changes. 








FIGURE 16.2 

Superficial changes in insect wing coloration thought to be caused by mutations 
in cis-regulatory elements. Such examples show that mutations that affect 
development and that also result in viable offspring tend to be minor. Courtesy 
National Academy of Sciences, U.S.A. 


The NCSE cited papers from the “evo-devo” literature claiming that a type of 
mutation in the regulatory regions of the genome, “cis-regulatory” regions, have 
been shown to produce large-scale changes in winged insects. According to the 
NCSE, mutations in these cis-regulatory elements (or CREs) are “considered by 
many evolutionary biologists to have the greatest potential for generating 
evolutionary change.”!° What’s more, they insisted that “mutations in CREs play 
an important role in morphological evolution.”!” The NCSE cited a paper in the 
Proceedings of the National Academy of Sciences by three developmental 
biologists, Benjamin Prud’homme, Nicolas Gompel, and Sean B. Carroll.!® 

The paper did not show what the NCSE claimed, however. It did assert that 


changes in regulatory DNA produce “both relatively modest morphological 
differences among closely related species and more profound anatomical 
divergences among groups at higher taxonomical levels.”!9 But the study only 
showed how changes in the cis-regulatory elements in fruit fly DNA might have 
affected the coloration of wing spots in several different types of flying insects. 
It did not report any significant change in the form or body plan of these insects. 
Instead, the study highlighted a clear case of a viable mutation generating merely 
a minor or superficial change (see Fig. 16.2). 

Not surprisingly, many evolutionary biologists recognize that such regulatory 
mutations do not explain the evolution of new body plans. For example, Hopi 
Hoekstra, of Harvard University, and Jerry Coyne, two traditional neo- 
Darwinists, have published an article reviewing various evo-devo proposals in 
the journal Evolution. They note, “Genomic studies lend little support to the cis- 
regulatory theory” of evolutionary change. 

They also argue, tellingly, that most cis-regulatory mutations result in the loss 
of genetic and anatomical traits, including a famous case in which evolutionary 
biologists attributed the loss of pelvic spines in stickleback fish to mutations in 
cis-regulatory elements.*° Yet, as they argue, “supporting the evo-devo claim 
that cis-regulatory changes are responsible for morphological innovations 
requires showing that promoters are important in the evolution of new traits, not 
just the losses of old ones.” Hoekstra and Coyne conclude, “There is no evidence 
at present that cis-regulatory changes play a major role—much less a pre- 
eminent one—in adaptive evolution.”*! Given their commitment to neo- 
Darwinism, it’s fair to assume that Hoekstra and Coyne probably did not intend, 
in making this argument, to refute the NCSE’s criticism of our textbook Explore 
Evolution. Nevertheless, science, like politics, sometimes makes for strange 
bedfellows. 


WHAT ABOUT HOX GENES? 


When biology students hear my colleague Paul Nelson describe the “great 
Darwinian paradox” (see Chapter 13) in public lectures on university campuses, 
they often ask, “What about Hox genes?” Recall that Hox (or homeotic) genes 
regulate the expression of other protein-coding genes during the process of 
animal development. Some biologists have likened them to the conductor of an 
orchestra who plays the role of coordinating the contributions of the players. 
And because Hox genes affect so many other genes, many evo-devo advocates 


think that mutations in these genes can generate large-scale changes in form. 

For example, Jeffrey Schwartz, at the University of Pittsburgh, invokes 
mutations in Hox genes to explain the sudden appearance of animal forms in the 
fossil record. In his book Sudden Origins, Schwartz acknowledges the 
discontinuities in the fossil record. As he notes, “We are still in the dark about 
the origin of most major groups of organisms. They appear in the fossil record as 
Athena did from the head of Zeus—full-blown and raring to go, in contradiction 
to Darwin’s depiction of evolution as resulting from the gradual accumulation of 
countless infinitesimally minute variations.” 

What resolves this mystery? Schwartz, an evo-devo advocate, reveals his 
answer: “A mutation affecting the activity of a homeobox [Hox] gene can have a 
profound effect—such as turning . . . larval tunicates into the first chordates. 
Clearly, the potential homeobox genes have for enacting what we call 
evolutionary change would seem to be almost unfathomable.”*° 

But can mutations in Hox genes transform one form of animal life—one body 
plan—into another? There are several reasons to doubt that they can. 

First, precisely because Hox genes coordinate the expression of so many 
other different genes, experimentally generated mutations in Hox genes have 
proven harmful. William McGinnis and Michael Kuziora, two biologists who 
have studied the effects of mutations on Hox genes, have observed that in fruit 
flies “most mutations in homeotic [Hox] genes cause fatal birth defects.”2* In 
other cases, the resulting Hox mutant phenotype, while viable in the short term, 
is nonetheless markedly less fit than the wild type. For example, by mutating a 
Hox gene in a fruit fly, biologists have produced the dramatic Antennapedia 
mutant, a hapless fly with legs growing out of its head where the antennae 
should be (see Fig. 16.3).7° Other Hox mutations have produced fruit flies in 
which the balancers (tiny structures behind wings that stabilize the insect in 
flight, called “halteres”) are transformed into an extra pair of wings.*° Such 
mutations alter the structure of the animal, but not in a beneficial or permanently 
heritable way. The Antennapedia mutant cannot survive in the wild; it has 
difficulty reproducing, and its offspring die easily. Similarly, fruit-fly mutants 
sporting an extra set of wings lack the musculature to make use of them and, 
absent their balancers, cannot fly. As Hungarian evolutionary biologist Eérs 
Szathmary notes with cautious understatement in the journal Nature, 
“macromutations of this sort [i.e., in Hox genes] are probably frequently 
maladaptive.”*” 





FIGURE 16.3 

Photograph of an Antennapedia mutant with a pair of legs growing out of its 
head, where antennae would normally develop. Such examples show that 
mutations that occur early in animal development and that also produce major 
changes typically result in less fit offspring—in this case offspring that cannot 
reproduce. Courtesy Elsevier, Inc. 


Second, Hox genes in all animal forms are expressed after the beginning of 
animal development, and well after the body plan has begun to be established. In 
fruit flies, by the time that Hox genes are expressed, roughly 6,000 cells have 
already formed, and the basic geometry of the fly—its anterior, posterior, dorsal, 
and ventral axes—is already well established.*® So Hox genes don’t determine 
body-plan formation. Eric Davidson and Douglas Erwin have pointed out that 
Hox gene expression, although necessary for correct regional or local 
differentiation within a body plan, occurs much later during embryogenesis than 
global body-plan specification itself, which is regulated by entirely different 
genes. Thus, the primary origin of animal body plans in the Cambrian explosion 
is not merely a question of Hox gene action, but of the appearance of much 
deeper control elements—Davidson’s “developmental gene regulatory networks” 
(dGRNs).*° And yet, as we saw in Chapter 13, Davidson argues that it is 
extremely difficult to alter (GRNs without damaging their ability to regulate 
animal development. 

Third, Hox genes only provide information for building proteins that function 
as switches that turn other genes on and off. The genes that they regulate contain 
information for building proteins that form the parts of other structures and 
organs. The Hox genes themselves, however, do not contain information for 
building these structural parts. In other words, mutations in Hox genes do not 


have all the genetic information necessary to generate new tissues, organs, or 
body plans. 

Nevertheless, Schwartz argues that biologists can explain complex structures 
such as the eye just by invoking Hox mutations alone. He asserts that “[t]here are 
homeobox genes for eye formation and that when one of them, the Rx gene in 
particular, is activated in the right place and at the right time, an individual has 
an eye.”°° He also thinks that mutations in Hox genes help arrange organs to 
form body plans. 

In a review of Schwartz’s book, Eérs Szathmary finds Schwartz’s reasoning 
deficient. He too notes that Hox genes don’t code for the proteins out of which 
body parts are made. It follows, he insists, that mutations in Hox genes cannot by 
themselves build new body parts or body plans. As he explains, “Schwartz 
ignores the fact that homeobox genes are selector genes. They can do nothing if 
the genes regulated by them are not there.”?! Though Schwartz says he has 
“marveled” at “the importance of homeobox genes in helping us to understand 
the basics of evolutionary change,”°* Szathmary doubts that mutations in these 
genes have much creative power. After asking whether Schwartz succeeds in 
explaining the origin of new forms of life by appealing to mutations in Hox 
genes, Szathmary concludes, “I’m afraid that, in general, he does not.”23 

Nor, of course, do Hox genes possess the epigenetic information necessary for 
body-plan formation. Indeed, even in the best of cases mutations in Hox genes 
still only alter genes. Mutations in Hox genes can only generate new genetic 
information in DNA. They do not, and cannot, generate epigenetic information. 

Instead, epigenetic information and structures actually determine the function 
of many Hox genes, and not the reverse. This can be seen when the same Hox 
gene (as determined by nucleotide sequence homology) regulates the 
development of different anatomical features found in different phyla. For 
instance, in arthropods the Hox gene Distal-less is required for the normal 
development of jointed arthropod legs. But in vertebrates a homologous gene 
(e.g., the Dix gene in mice) builds a different kind of (nonhomologous) leg. 
Another homologue of the Distal-less gene in echinoderms regulates the 
development of tube feet and spines—anatomical features classically thought not 
to be homologous to arthropod limbs, nor to limbs of tetrapods.** In each case, 
the Distal-less homologues play different roles determined by the higher-level 
organismal context. And since mutations in Hox genes do not alter higher-level 
epigenetic contexts,*° they cannot explain the origin of the novel epigenetic 
information and structure that establishes the context and that is necessary to 


building a new animal body plan.*° 


NEUTRAL OR NONADAPTIVE EVOLUTION MICHAEL LYNCH, A 
GENETICIST AT INDIANA UNIVERSITY, HAS OFFERED A DIFFERENT 
MECHANISM OF EVOLUTIONARY CHANGE, AND A DIFFERENT 
EXPLANATION FOR THE ORIGIN (OR GROWTH) OF THE GENOME AS 
WELL AS THE ORIGIN OF ANATOMICAL NOVELTY. LYNCH PROPOSES 
A NEUTRAL OR “NONADAPTIVE” THEORY OF EVOLUTION IN WHICH 
NATURAL SELECTION PLAYS A LARGELY INSIGNIFICANT ROLE. HIS 
THEORY IS BASED ON CONTRASTING OBSERVATIONS ABOUT THE 
FEATURES AND STRENGTH OF EVOLUTIONARY MECHANISMS AT 
WORK IN POPULATIONS OF DIFFERENT SIZES. 


He observes, first, that in general, the larger the population of organisms, the 
lower the mutation rate and (in sexually reproducing eukaryotes) the higher the 
rate of genetic recombination. He notes that the genomes of organisms in larger 
populations (such as those of bacteria and unicellular eukaryotic organisms) tend 
to be smaller and more streamlined—meaning they have fewer intervening 
nonprotein-coding sequences (i.e., introns). Most important, he notes that in 
large populations, natural selection tends to be relatively effective in eliminating 
deleterious mutations and fixing beneficial ones, whereas the process of genetic 
drift (the tendency for gene variants to be lost through random processes) plays a 
relatively less significant role. 

By contrast, Lynch observes that small populations—which would include 
almost all animal groups—are characterized by higher mutation rates and lower 
rates of genetic recombination. They also tend to have large genomes with a lot 
of nonprotein-coding DNA—introns, pseudogenes, transposons and various 
repetitive DNA elements—as well as gene duplicates. Lynch argues that in these 
small populations, natural selection tends to be weak—unable to remove mildly 
deleterious mutations or to fix mildly beneficial ones efficiently. As Lynch 
summarizes, “Three factors (low population sizes, low recombination rates and 
high mutation rates) conspire to reduce the efficiency of natural selection with 
increasing organism size.”°’ Consequently, nonprotein-coding elements are not 
removed from the genome, but instead tend to accumulate, causing the genomes 
of organisms living in small populations to grow—even though these sequences 
may be neutral or even deleterious. Moreover, in small populations, “neutral” 
processes such as random mutation, genetic recombination, and genetic drift 


predominate in their effects over natural selection. 

What does all this have to do with the origin of animals and the Cambrian 
explosion? Evolutionary biologists think that the ancestral groups of the 
Cambrian animals would likely have existed in relatively small populations. 
Lynch argues that in small populations, animal genomes will inevitably grow 
over time as nonprotein-coding sections of DNA (as well as gene duplicates) 
accumulate due to the weakness of natural selection. He thinks these neutral 
mutations drive the evolution and growth of genomic and phenotypic complexity 
in animals. In short, Lynch attempts to explain the expansion of the genome and 
the origin of anatomical complexity as the result of neutral, nonadaptive 
processes of genetic accretion, rather than as an adaptive process involving 
natural selection acting on random mutations. As he states, “Many of the unique 
complexities of the eukaryotic gene arose by semi-neutral processes with little, if 
any, direct involvement of positive selection.”°° 

In his work, Lynch has also advanced a powerful mathematical critique of the 
efficacy of the neo-Darwinian mechanism. He has argued that natural selection 
plays a lesser role in shaping the features of evolving populations than many 
evolutionary theorists have previously assumed—especially in the case of 
relatively small populations. Lynch has argued, instead, that random 
environmental factors—an organism being in the right place at the right time 
(near an abundant food source, for example) or in the wrong place at the wrong 
time (in a drought-stricken region or near an erupting volcano, for example)— 
will play a more important role in determining reproductive success than 
variations in the fitness of organisms within the same population. 


COUNTERINTUITIVE APPROACH LYNCH DEVELOPS HIS 
MATHEMATICAL CRITIQUE OF THE CREATIVE POWER OF NATURAL 
SELECTION BASED ON THE PRINCIPLES OF POPULATION GENETICS. 

NEVERTHELESS, IT DOES NOT FOLLOW FROM HIS ANALYSIS 
SHOWING THE WEAKNESS OF NATURAL SELECTION THAT NEUTRAL 
PROCESSES ALONE ARE SUFFICIENT TO BUILD NEW FUNCTIONAL 
GENES AND PROTEINS. NOR DOES IT FOLLOW THAT NEUTRAL 
PROCESSES ALONE CAN ACCOUNT FOR THE MANY COMPLEX 
ANATOMICAL SYSTEMS THAT REQUIRE NEW SOURCES OF GENETIC 
(AND EPIGENETIC) INFORMATION FOR THEIR CONSTRUCTION. 
INDEED, IT MAY SEEM COUNTERINTUITIVE, AT LEAST FROM A NEO- 
DARWINIAN POINT OF VIEW, TO THINK THAT THE ACCUMULATION 


OF RANDOM MUTATIONS ALONE CAN ACCOMPLISH WHAT NEO- 
DARWINISTS HAVE LONG INVOKED BOTH MUTATIONS AND 
NATURAL SELECTION TO DO. IN EFFECT, LYNCH’S THEORY 

ATTEMPTS TO EXPLAIN THE ORIGIN OF ANATOMICAL COMPLEXITY 
BY REFERENCE TO WHAT WOULD SEEM ON ITS FACE TO BE A LESS 
—NOT A MORE—POTENT MECHANISM THAN THE ONE OFFERED BY 
NEO-DARWINISM. COULD SUCH A COUNTERINTUITIVE THEORY BE 
CORRECT? 


Perhaps, but as a comprehensive theory of how biological information and 
anatomical complexity arises, Lynch’s neutral theory leaves much to be desired. 

In the first place, Lynch’s theory offers no explanation for some of the crucial 
molecular machinery present in eukaryotes—machinery that is necessary to 
rendering his mechanism for the accumulation and subsequent expression of 
genetic information credible. Recall that Lynch thinks that small populations of 
multicellular organisms in particular would have inevitably accumulated many 
insertional genomic elements. But for the functional information in these 
growing genomes to be expressed, the cell must have some way of excising the 
nonfunctional randomly accreting genetic elements—at least, until some of them 
mutate to the point that they contribute to producing functional genes and 
proteins. 

Extant eukaryotic organisms depend on a sophisticated molecular machine 
called a spliceosome—a machine that excises introns and fuses together exons 
(the portions of the genome that code for proteins) before gene expression takes 
place. “This large complex,” observes cell biologist Melissa Jurica, is 
“composed of over 150 individual proteins” and several structural RNAs, and 
thus “may indeed deserve the moniker ‘the most complicated macromolecular 
machine in the cell.’ ”°? 

So where do spliceosomes, and the genes necessary to produce them, come 
from? Lynch doesn’t say, though he recognizes, of course, the importance of this 
molecular machinery to gene expression and to his scenario. As he explains, 
“The problem is that introns are inside genes and get transcribed to mRNA but 
then have to be spliced out perfectly. If you’re one nucleotide off, you get a dead 
transcript.”4° Nevertheless, Lynch’s theory presupposes, but does not explain, 
the origin of the genetic information necessary to produce the spliceosomes that 
perform this function. He certainly does not explain the origin of these massive 
multi-protein, multi-RNA complexes by reference to any neutral evolutionary 


process. Nor can he, since his theory of genomic accretion and expression 
presupposes the existence of precisely such intricate machines. Instead, as my 
colleague Paul Nelson has put it rather colorfully, “to get Lynch’s theory of 
genomic accretion up and running, a great deal of complicated molecular 
machinery must be rolled in from offstage.” 

Of course, it could be argued that these machines and systems arose much 
earlier with the origin of the eukaryotic cell as the result of selection-driven 
evolution in the large populations of simpler unicellular organisms in which, 
according to Lynch’s theory, natural selection played a more significant role. 
Nevertheless, Lynch does not make that argument—and for good reason. Most 
evolutionary biologists today recognize the origin of the eukaryotic cell as a 
completely unsolved problem—unexplained by either neutral or adaptive 
theories of evolution.*! 

Of course, insofar as these molecular machines are present in even one-celled 
eukaryotic organisms, they would have arisen, presumably, well before the 
origin of animals. Thus, explaining their origin is not, strictly speaking, directly 
relevant to explaining the Cambrian explosion. Nevertheless, Lynch’s inability to 
account for their origin reflects directly on the credibility of his theory—at least 
insofar as it seeks to offer a comprehensive account of the mechanisms by which 
biological information and complexity arise during the history of life. 


Drifting In and Out In any case, there are good reasons to doubt that Lynch’s 
neutral mechanism could generate the novel biological information and form 
necessary to explain the origin of animals, even granting the prior existence of 
the molecular machinery (in small populations of eukaryotic organisms) that his 
scenario requires. 


First, Lynch assumes a false gene-centric view of the origin of biological 
form. As he writes: “Most of the phenotypic diversity that we perceive in the 
natural world is directly attributable to the peculiar structure of the eukaryotic 
gene.”** His view overlooks the crucial role of epigenetic information and 
structure in the origin of animal form discussed in Chapter 14 and, therefore, 
does nothing to explain its origin. 

Second, neutral processes such as genetic drift do not favor beneficial 
mutations, and thus do not fix, with any efficiency, those mutation-induced 
genetic traits in small populations.*° Natural selection, as we saw in Chapter 10, 
is something of a double-edged sword. On the one hand, natural selection helps 


to fix beneficial traits in a population. On the other hand, natural selection also 
makes it difficult for functional genes to vary widely without being eliminated. 
Neutral theories of evolution attempt to avoid the latter problem by invoking 
gene duplication and other processes that can add nonfunctional sequences to the 
genome—sequences that are unaffected, at least initially, by selective pressure. 
In so doing, however, these theoretical formulations significantly diminish the 
role of natural selection as a mechanism that can fix beneficial mutations in 
place once they have arisen. Thus, in all neutral theories, including Lynch’s, any 
beneficial mutations that arise and begin to drift through a population, can just as 
readily—without a significant influence from natural selection to impede it— 
drift out of a population as well. This limitation vastly increases the time it will 
take for neutral processes to fix beneficial genetic changes in a population. Both 
skeptics and proponents of neo-Darwinism have recognized this deficiency in 
Lynch’s model.# 

Third, and most important, Lynch’s theory not only fails to account for the 
fixation of new genes and traits in small populations, it also fails to account for 
their origin. Lynch’s mechanism of neutral mutational change does envision the 
addition of brute genomic complexity as the result of the accretion of preexisting 
genetic elements (introns, transposons, pseudogenes and gene duplicates). 
Nevertheless, the addition of these elements does not generate any novel 
functional (or specified) genetic information. Instead, it merely transfers 
preexisting genetic sequences from one organismal context where those 
sequences may have performed a function, to another where they likely will not. 
Indeed, the point of neutral theory is to postulate the addition of genetic elements 
that, initially, do not perform crucial functions such that they can experience 
mutations without deleterious consequence to the organism. Lynch himself 
assumes that these added elements will not perform functions in their new 
context, which is why he envisions the need for spliceosomes to excise them, at 
least initially. 

Instead, for Lynch’s theory to explain the origin of new and functional genes 
and proteins (and the anatomical complexities that depend on them), his theory 
would have to solve the problem of combinatorial inflation discussed in Chapter 
10. He would have to show that purely random mutations could efficiently 
search the relevant combinatorial space of possible sequences corresponding to a 
given novel functional gene or protein. 

Nevertheless, Lynch does not even address the problem of combinatorial 
inflation or the closely related problem of the rarity of genes and proteins in 


sequence space. He provides no experimental evidence that recombination 
and/or mutation (given genetic drift) will actually produce functional or specified 
genetic complexity. Instead, the examples he provides are entirely hypothetical. 
In addition, he offers no reason to think that the probability of a successful 
search for functional genes or proteins would be any higher (i.e., more likely to 
occur) than the probabilities calculated in Chapter 10. He does not, therefore, 
answer the challenge of the problem of combinatorial inflation and the rarity of 
functional genes and proteins in sequence space. 

Lynch does provide, perhaps, a more detailed characterization than other 
neutral theories of where neutral, nonadaptive processes must predominate. 
Nevertheless, he does not show that such processes—random genetic mutations 
unhinged from natural selection—are sufficient to generate novel functional 
genes and proteins, let alone complex anatomical novelties requiring the origin 
of many such genes and proteins. Instead, as Axe’s experimental results have 
shown, random mutations of whatever kind will not generate enough trials to 
render probable (or plausible) a successful search of the sequence space 
corresponding to a given functional gene or protein. 


LYNCH AND WAITING TIMES LYNCH DOES ARGUE IN ONE PAPER 
THAT NEUTRAL EVOLUTIONARY PROCESSES CAN GENERATE NEW 
COMPLEX ADAPTATIONS—ADAPTATIONS REQUIRING MULTIPLE 
COORDINATED MUTATIONS—WITHIN REALISTIC WAITING TIMES. IN 
PARTICULAR, WRITING IN A RECENT PAPER WITH COLLEAGUE 
ADAM ABEGG OF ST. LOUIS UNIVERSITY, HE ARGUES THAT 
“CONVENTIONAL POPULATION GENETIC MECHANISMS” SUCH AS 
RANDOM MUTATION AND GENETIC DRIFT CAN CAUSE THE 
“RELATIVELY RAPID EMERGENCE OF SPECIFIC COMPLEX 
ADAPTATIONS.”*° LYNCH MAKES TWO SPECIFIC CLAIMS IN THIS 
REGARD. FIRST, HE CLAIMS THAT IN LARGE POPULATIONS, 
ARBITRARILY COMPLEX ADAPTATIONS CAN OCCUR IF THE 
MUTATIONAL INTERMEDIATES ARE NEUTRAL IN THEIR EFFECTS ON 
THE ORGANISM. THAT IS, LYNCH PURPORTS TO SHOW THAT IN 
LARGE MICROBIAL POPULATIONS, COMPLEX ADAPTATIONS 
REQUIRING A VIRTUALLY UNLIMITED NUMBER OF MUTATIONS CAN 
OCCUR WITHIN REALISTIC WAITING TIMES. ACCORDING TO LYNCH, 
THIS CAN OCCUR PROVIDED THAT EACH MUTATION IN A SERIES OF 
MUTATIONS HAS NEUTRAL (BUT NOT DELETERIOUS) EFFECTS ON 


THE ORGANISM. SECOND, LYNCH ARGUES THAT EVEN THOUGH IT 
GENERALLY TAKES LONGER TO BUILD COMPLEX TRAITS IN SMALL 
POPULATIONS, SUCH TRAITS CAN STILL EVOLVE WITHIN REALISTIC 

WAITING TIMES PROVIDED, AGAIN, THAT THE MUTATIONAL 
INTERMEDIATES ARE NEUTRAL IN THEIR EFFECTS. IN FACT, HE 
CONCLUDES THAT “THE ELEVATED POWER OF BOTH RANDOM 

GENETIC DRIFT AND MUTATION MAY ENABLE THE ACQUISITION OF 
COMPLEX ADAPTATIONS IN MULTICELLULAR SPECIES AT RATES 
THAT ARE NOT APPRECIABLY DIFFERENT FROM THOSE 
ACHIEVABLE IN ENORMOUS MICROBIAL POPULATIONS.”“° 


Though Lynch makes these assertions in the context of a densely 
mathematical scientific article, the significance of his claims, if true, can hardly 
be overstated. In essence, he claims that his population genetics—based 
mathematical model shows that purely random mutations and genetic drift can 
generate extremely complex adaptations in realistic waiting times—that his 
neutral evolutionary theory solves the problem of complex adaptations and long 
expected waiting times discussed in Chapter 12. 

But some things are just too good to be true, and it turns out that Lynch and 
Abegg made a subtle but fundamental mathematical error in coming to their 
conclusion. Appropriately, perhaps, the first person to demonstrate that Lynch’s 
incredible claim was problematic was Douglas Axe. Although Axe could see 
that much of the math in Lynch’s paper with Abegg was correct, Axe suspected 
from his own calculations and experiments that they had made some crucial 
error. In the end, he traced Lynch and Abegg’s claims to two erroneous 
equations, both of which were based on an erroneous assumption. In essence, 
Lynch and Abegg assumed that organisms will acquire a given complex 
adaptation by traversing a direct path to the new anatomical structure. Each 
mutation would build on the previous one in the most efficient manner possible 
—with no setbacks, false starts, aimless wandering, or genetic degradation— 
until the desired structure or system (or gene) is constructed. Thus, they 
formulated an undirected model of evolutionary change, and one that assumes, 
moreover, that there is no mechanism available (such as natural selection) that 
can lock in potentially favorable mutational changes on the way to some 
complex advantageous structure. Nevertheless, they calculated the waiting times 
required to produce such structures as if a process for locking in potentially 
advantageous changes did exist, and as if their undirected and purely random 


mechanism was in some way directed to these functionally propitious outcomes. 
As Axe notes in a trenchant mathematical critique of Lynch and Abegg’s 
argument, “Of all the possible evolutionary paths a population can take, the 
analysis of Lynch and Abegg considers only those special paths that lead directly 
to the desired end—the complex adaptation.”*7 

Yet nothing in Lynch’s neutral model ensures that potentially advantageous 
mutations will remain in place while other mutations accrue. As Axe explains, 
“Productive changes cannot be ‘banked’, whereas Equation 2 [one of Lynch’s 
equations] presupposes that they can.”4° Instead, Axe shows, mathematically, 
that degradation (the fixation of mutational changes that make the complex 
adaptation less likely to arise) will occur much more rapidly than constructive 
mutations, causing the expected waiting time to increase exponentially. 

The illustration I developed in Chapter 10 to explain the problem facing 
neutral models of gene evolution may help illuminate the mistaken assumption 
underlying Lynch and Abegg’s calculations. Recall from that chapter that my 
hypothetical blindfolded man dropped in the middle of an enormous shark-free 
pool did not face any predators (by analogy the purifying effects of natural 
selection). But he still faced the problem of finding the ladder at the edge of the 
enormous body of water (by analogy, the need to search an enormous number of 
possible mutational paths and possible sequences to find the rare functional 
ones). Now suppose someone were to calculate how long on average it would 
take for the blindfolded man to swim to the ladder at the edge of the enormous 
pool. If someone simply divided the distance to the ladder at the edge of the pool 
by the maximum speed that man could swim, he or she would get a fantastically 
optimistic estimate of the severity of the problem facing our unfortunate 
swimmer. Why? Because calculating the probable waiting time in this manner 
would overlook the main problem the man faced, namely, the man does not 
know where the ladder is or how to get there. Nor does he have any way to 
gauge his progress. 

Thus, any realistic estimate of how long it will actually take him to swim to 
the ladder—as opposed to an estimate of the theoretically fastest route possible 
—must take into account his probable aimless wandering, fits and starts, 
swimming in circles and drifting in various directions. Similarly, Lynch and 
Abegg fail to reckon in their calculation on the random, undirected and, literally, 
aimless nature of the mechanism that they propose. Instead, they mistakenly 
assume that neutral processes of evolution will make a beeline for some specific 
complex adaption. In fact, these processes will—in all probability—also wander 


aimlessly in a vast sequence space of neutral, functionless possibilities with 
nothing to direct them, or preserve them in any forward progress they happen to 
make, toward the rare and isolated islands of function represented by complex 
adaptations. For this reason, Lynch vastly underestimates the waiting times 
required to generate complex adaptations and, therefore, does not solve the 
problem of the origin of genes and proteins or any other complex adaptation. 
Instead, Axe shows in his own mathematical model, which accompanies his 
critique of Lynch, that the waiting times problem is just as severe as he (Axe) 
had previously calculated. 


NEO-LAMARCKIAN EPIGENETIC INHERITANCE THE THIRD 
ELEMENT OF THE NEO-DARWINIAN TRIAD CONCERNS THE 
TRANSMISSION AND INHERITANCE OF GENETIC INFORMATION. NOT 
SURPRISINGLY, A NEW VERSION OF EVOLUTIONARY THEORY 
QUESTIONS THE NEO-DARWINIAN UNDERSTANDING OF HEREDITY 
AS WELL. 


Darwin himself lacked an accurate theory of how features of organisms are 
transmitted from one generation to the next. He thought that changes in 
organisms that occurred during their lifetimes, as the result of the “use and 
disuse” of different organs and anatomical systems, would be transmitted to 
offspring through reproduction.*? In this respect, his theory of inheritance 
resembled that of an earlier evolutionary theorist, Jean-Baptiste de Lamarck 
(1744-1829), who also believed in the inheritance of acquired characteristics. 

Lamarckian mechanisms, although unsupported by any evidence at the time, 
came to play an increasingly important role in Darwin’s thinking, as criticisms of 
natural selection caused Darwin to place more weight on the direct influence of 
the environment in evolutionary change. Indeed, by the sixth edition of the 
Origin (1872), Darwin specifically emphasized the importance of these modes of 
inheritance.” 

But with the rediscovery of Mendel’s laws in 1900 and the identification of 
chromosomes as the material entity responsible for the transmission of 
inheritance, Lamarckian theories of inheritance fell out of favor. Following the 
rise of the neo-Darwinian synthesis, most evolutionary biologists came to regard 
the gene as the locus of all heritable change in the organism. And after 1953, 
biologists equated the gene with specifically arranged nucleotide bases within 
the DNA molecule. 


Recently, however, as more biologists have recognized that some biological 
information—epigenetic information—resides in structures outside of DNA, 
interest has grown in the possibility that these nongenetic sources of information 
may influence the course of evolution. The discovery that epigenetic information 
can be altered and directly inherited independently of DNA has attracted further 
attention. This discovery has, in turn, led to the formulation of a contemporary 
“neo-Lamarckian””! theory that envisions changes in the nongenetic structures 
of an organism affecting subsequent generations. 

Today, prominent defenders of neo-Lamarckism include Eva Jablonka, of Tel 
Aviv University, and Massimo Pigliucci, of the City University of New York. 
Lamarck, of course, knew nothing about the role of genes and believed that 
inheritance of acquired characteristics was an important driving force in 
evolution. Modern neo-Lamarckians, fully apprised of the reality of genetic 
inheritance, nevertheless think that nongenetic sources of information and 
structure may play some role in the evolution of biological form. According to 
Jablonka, neo-Lamarckism “allow[s] evolutionary possibilities denied by the 
‘Modern Synthesis’ version of evolutionary theory, which states that variations 
are blind, are genetic (nucleic acid—based), and that saltational events do not 
significantly contribute to evolutionary change.”°* 

Jablonka has collected several categories of evidence in support of what she 
calls “epigenetic inheritance systems.” In the first place, in some single-celled 
organisms (such as E. coli and yeast) environmentally induced changes in 
metabolic pathways can be transmitted to the next generation independently of 
any changes in the cell’s DNA. Second, she notes that structural information 
mediating organismal form (and function) does pass from parent to offspring 
independently of DNA, via membranes and other three-dimensional cellular 
patterns. 

Third, she discusses the process of DNA methylation—a process in which 
special enzymes attach a methyl group (CH3) to nucleotide bases within the 


double helix. Processes like this can alter gene regulation and chromatin 
structure. Jablonka notes that the changes produced by processes that alter gene 
regulation are often transmitted to subsequent generations of cells without any 
changes to DNA base sequences. Finally, she cites a process called “RNA- 
mediated” epigenetic inheritance, a recently discovered phenomenon. Here, 
small RNAs, again acting in concert with special enzymes, affect gene 
expression and chromatin structure, and these modifications appear to be 
heritable independently of genes. 


Can any of these mechanisms help to explain the origin of animal form in the 
Cambrian explosion? Not really. 

By its nature, macroevolution requires stable—meaning permanently 
heritable—changes. But Jablonka’s evidence shows that where nongenetic 
inheritance occurs in animals, it involves structures that either (a) do not change 
(such as membrane patterns and other persistent templates of structural 
information), or (b) do not persist over more than several generations. And 
neither case generates significant evolutionary innovation in animal form. 
Instead, for directional evolutionary change to occur in a population of 
organisms, changes must not only be heritable, but permanent. Stability—the 
irreversible and enduring heritability of traits—is a logically inescapable 
requirement for any theory of evolution. This is precisely what “descent with 
modification” means. 

And here Jablonka’s evidence for stable nongenetic inheritance is equivocal 
at best, as she readily admits. Reviewing Jablonka’s assembled data for animals 
reveals no case where an induced epigenetic change persisted permanently in 
any population. The heritability of such changes is transient, lasting (depending 
on the species in question) from a few generations up to forty. 

Jablonka candidly addresses this lack of evidence for stability, noting, “We 
believe that epigenetic variants in every locus in the eukaryotic genome can be 
inherited, but in what manner, for how long, and under what conditions, has yet 
to be qualified.”°’ Consequently, despite its intriguing aspects, the evolutionary 
significance of neo-Lamarckian epigenetic inheritance remains uncertain or, in 
Jablonka’s own words, “inevitably, somewhat speculative.”°* 


NATURAL GENETIC ENGINEERING UNIVERSITY OF CHICAGO 
GENETICIST JAMES SHAPIRO HAS FORMULATED ANOTHER POST- 
DARWINIAN PERSPECTIVE ON HOW EVOLUTION WORKS THAT HE 

CALLS “NATURAL GENETIC ENGINEERING.” SHAPIRO HAS 

DEVELOPED AN UNDERSTANDING OF EVOLUTION THAT TAKES 
ACCOUNT OF THE INTEGRATED COMPLEXITY OF ORGANISMS AS 

WELL AS THE IMPORTANCE OF NONRANDOM MUTATIONS AND 

VARIATIONS IN THE EVOLUTIONARY PROCESS. 


He observes that organisms within a population often modify themselves in 
response to different environmental challenges. He cites evidence showing that 
when populations are challenged by environmental stresses, signals, or triggers, 


organisms do not generate mutations or make genetic changes randomly, that is, 
without respect to, or unguided by, their survival needs. Instead, they often 
respond to environmental stresses or signals by inducing mutations in a directed 
or regulated way. As he explains, “The continued insistence on the random 
nature of genetic change by evolutionists should be surprising for one simple 
reason: empirical studies of the mutational process have inevitably discovered 
patterns, environmental influences, and specific biological activities at the roots 
of novel genetic structures and altered DNA sequences.””? 

The depth of Shapiro’s challenge to orthodox neo-Darwinism is profound. He 
rejects the randomness of novel variation that Darwin himself emphasized and 
that neo-Darwinian theorists throughout the twentieth century have reaffirmed.°° 
Instead, he favors a view of the evolutionary process that emphasizes 
preprogrammed adaptive capacity or “engineered” change, where organisms 
respond in a “cognitive” way to environmental influences, rearranging or 
mutating their genetic information in regulated ways to maintain viability. 

As an example, Shapiro notes that—contrary to the neo-Darwinian 
assumption that “DNA alterations are accidental”°’—all organisms possess 
sophisticated cellular systems for proofreading and repairing their DNA during 
its replication. He notes that these systems are “equivalent to a quality-control 
system in human manufacturing,” where the “surveillance and correction” 
functions represent “cognitive processes, rather than mechanical precision.”°® 

As an example of regulated mutation, Shapiro observes that in response to 
environmental assault—UV damage from sunlight or the presence of an 
antibiotic, for instance—bacteria activate what is known as the “SOS response” 
system. This system makes use of specialized error-prone DNA polymerases, 
normally left unexpressed, that are synthesized and set into action, allowing the 
population to generate a much wider range of genetic variation than usual. 
Bacterial cells regulate this process using a DNA-binding protein known as 
LexA, which normally represses the error-prone polymerases. When the SOS 
system is activated by environmental damage, the production of LexA first drops 
dramatically, allowing expression of the error-prone polymerases, but then rises, 
which “ensures that as soon as DNA repair occurs . . . LexA [will] reaccumulate 
and repress the SOS genes.”°? This system allows cells to “replicate DNA that 
carries unrepaired damage,” keeping their essential replication machinery 
moving past a stall, in the absence of which the bacterium would die. 

An analogy may help to illustrate what the cell is doing when confronted with 
an environmental challenge. Imagine a military unit, a combined armor and 


infantry battalion, crossing an open plain. Suddenly, the battalion falls under a 
fierce, unrelenting enemy artillery barrage, wounding many of its soldiers. To 
keep the wounded alive until the barrage ceases or reinforcements arrive, the 
commander instructs certain members of the unit with destructive skills to 
disassemble (in military jargon, “cannibalize”) a few of the tanks to provide 
temporary armored cover from further incoming shells. His order tells them, 
however, to cease their tank-modifying actions as soon as the barrage ends. That 
is, the unit as a whole tolerates “damage” to some of its equipment to save as 
many of its members as possible. 

In the same way, while at one level their “error-prone” role may appear 
counterintuitive, these mutation-generating DNA polymerases of the SOS 
system actually constitute essential hardware in the cell’s defensive armory.°! 
From Shapiro’s perspective, this survival strategy does not exemplify Darwinian 
randomness, but rather sophisticated preprogramming, an “apparatus that even 
the smallest cells possess” to maintain viability.°* What’s more, the carefully 
regulated expression of the SOS response provides evidence that cells employ 
the system only when needed. 

In addition to ramping up their mutation rates in specific sections of the 
genome, cells may also change the way they express the genetic information that 
they already carry, expressing some genes that were previously unexpressed and 
suppressing others. Organisms in populations under particular stresses may 
retrieve and access modular elements of genetic information stored in disparate 
locations on the genome or even on different chromosomes. Cells will then 
assemble, or concatenate, those modular elements to form a new gene or RNA- 
transcript capable of directing the synthesis of a novel protein or proteins that 
can help the organism survive. 

Shapiro argues that these and other kinds of directed, rather than random, 
genetic changes and responses to stimuli occur under “algorithmic control.” He 
describes the cell as “a powerful real-time distributed computing system” 
implementing various “if-then” subroutines. This emphatically challenges one of 
the three key elements of the neo-Darwinian triad: the claim that mutations and 
variations occur in a strictly random way. 

During the last fifteen years, Shapiro has published a series of fascinating 
papers about the newly discovered capacities of cells to direct or “engineer” the 
genetic changes they need to remain viable in a range of environmental 
conditions. His work represents a promising avenue of new biological research, 
bringing insight into how the cell’s information-processing system modifies and 


directs the expression of its genetic information in real time in response to 
different signals. Shapiro’s work also provides new insights into how observable 
evolutionary changes occur in living populations. 

Could it, then, also provide a solution to the problem of the origin of the 
information necessary to build an animal body plan? It could, except for one 
question that Shapiro’s otherwise brilliant characterization of how organisms 
modify themselves doesn’t address. 

Where does the programming come from that accounts for the 
“preprogrammed adaptive capacity” of living organisms? If, as James Shapiro 
argues, natural selection and exclusively random mutations don’t produce this 
information-rich preprogramming, then what did? In the next chapters, Ill 
propose an answer to precisely this question. 


17 
The Possibility of Intelligent Design 


The owner of a remote island estate has been murdered while out riding. When 
the local sheriff arrives, he learns there are several obvious suspects: the volatile 
gamekeeper, the owner of a neighboring estate with whom the murder victim has 
had a long-running feud, and the estate owner’s estranged wife, who had been 
living on the island in a small mother-in-law cottage. The sheriff quickly learns 
the basic facts of the case. The victim was found dead, facedown on the beach, 
with his horse standing nearby. Any one of the three suspects could have taken a 
rifle, from an unlocked shed at the edge of the property. All were healthy enough 
to have hiked to the scene of the crime. Each of them has a motive. And none 
has an alibi. 

But as the investigation unfolds additional facts come to light. Most 
importantly, when the coroner arrives, he determines that although the victim 
was shot in the stomach and then his head was harshly bludgeoned by the butt of 
the rifle, these injuries served merely to conceal the bullet wound that actually 
killed the estate owner. The man was dead when he hit the ground. What killed 
him was a perfect shot entering the head just behind the right ear, exactly where 
an expert marksman would place a bullet. Moreover, ballistics shows that this 
bullet came from a different gun altogether from the one stored in the shed, a 
weapon likely fired from quite a distance. 

The sheriff then returns to the list of suspects and, one by one, eliminates 
them. Abundant evidence shows that none of the three prime suspects is a 
particularly good shot, much less a world-class marksman. The landowner’s 
estranged wife has a shaky hand and no experience with firearms. The volatile 
gamekeeper has extremely poor eyesight. And the neighboring landowner turns 
out to have an alibi after all—as well as a broken arm, which would have 
prevented him from holding the kind of rifle from which the bullet was fired. 
There is, however, one other person living on the estate, though not even the 
other suspects suspect him. He is the victim’s loyal and longtime personal 


assistant, a timorous older man much beloved by both the family and the other 
servants. No one wants to consider him as a possible suspect. But is it possible 
that he could have had something to do with the crime after all? Might an 
unexpected suspect—indeed “the butler”—have done it? 

Clearly, standard evolutionary theory has reached an impasse. Neither neo- 
Darwinism nor a host of more recent proposals (punctuated equilibrium, self- 
organization, evolutionary developmental biology, neutral evolution, epigenetic 
inheritance, natural genetic engineering) have succeeded in explaining the origin 
of the novel animal forms that arose in the Cambrian period. Yet all these 
evolutionary theories have two things in common: they rely on strictly material 
processes, and they also have failed to identify a cause capable of generating the 
information necessary to produce new forms of life. 

This raises a question. Is it possible that a different or unexpected kind of 
cause might provide a more adequate explanation for the origin of the new form 
and information—as well as the other distinctive features—present in the 
Cambrian explosion? In particular, is it possible that intelligent design—the 
purposeful action of a conscious and rational agent—might have played a role in 
the Cambrian explosion? 


INTRODUCING INTELLIGENT DESIGN WHEN THE CASE FOR 
INTELLIGENT DESIGN IS MADE, IT’S OFTEN HARD TO GET 
CONTEMPORARY EVOLUTIONARY BIOLOGISTS TO SEE WHY SUCH 
AN IDEA SHOULD EVEN BE CONSIDERED OR WHY DISCUSSIONS OF 
DESIGN SHOULD PLAY ANY ROLE IN BIOLOGY AT ALL. THOUGH 
MANY BIOLOGISTS NOW ACKNOWLEDGE SERIOUS DEFICIENCIES IN 
CURRENT STRICTLY MATERIALISTIC THEORIES OF EVOLUTION, 
THEY RESIST CONSIDERING ALTERNATIVES THAT INVOLVE 
INTELLIGENT GUIDANCE, DIRECTION, OR DESIGN. 


Much of this resistance seems to come simply from not understanding what 
the theory of intelligent design is. Many evolutionary biologists see intelligent 
design as a religiously based idea—a form of biblical creationism. Others think 
the theory denies all forms of evolutionary change. But contrary to media 
reports, intelligent design is not a biblically based idea, but instead an evidence- 
based theory about life’s origins—one that challenges some, but not all, 
meanings of the term “evolution.” 

Perhaps the best way to explain the theory of intelligent design is to contrast 


it with the specific aspect of the theory of Darwinian evolution that it directly 
challenges. Recall from our opening discussion in Chapter 1 that the term 
“evolution” has many different meanings and that Darwin’s theory of evolution 
by natural selection affirmed several of them: first, change over time; second, 
universal common descent; and third, the creative power of natural selection 
acting on random variations. In affirming this third meaning of evolution, both 
classical Darwinism and modern neo-Darwinism also affirm what neo-Darwinist 
Richard Dawkins has called the “blind watchmaker” hypothesis. This hypothesis 
holds that the mechanism of natural selection acting on random genetic 
variations (and mutations) can produce not just new biological form and 
structure, but also the appearance of design in living organisms.! 

Darwin argued for this idea in The Origin of Species as well as in his letters. 
Recall the sheep breeding illustration from Chapter 1 where I described how 
both intelligent human breeders and environmental change (a series of bitterly 
cold winters) might produce an adaptive advantage in a population of sheep. 
During the nineteenth century, biologists regarded the adaptation of organisms to 
their environment as one of the most powerful pieces of evidence of design in 
the living world. By observing that natural selection had the power to produce 
such adaptations, Darwin not only affirmed that his mechanism could generate 
significant biological change, but that it could explain the appearance of design 
—without invoking the activity of an actual designing intelligence. In doing so, 
he sought to refute the design hypothesis by providing a materialistic 
explanation for the origin of apparent design in living organisms. Modern neo- 
Darwinists also affirm that organisms look as if they were designed. They also 
affirm the sufficiency of an unintelligent natural mechanism—mutation and 
natural selection—as an explanation for this appearance. Thus, in both 
Darwinism, and neo-Darwinism, the selection/variation (or selection/mutation) 
mechanism functions as a kind of “designer substitute.” As the late Harvard 
evolutionary biologist Ernst Mayr explains: “The real core of Darwinism . . . is 
the theory of natural selection. This theory is so important for the Darwinian 
because it permits the explanation of adaptation, the ‘design’ of the natural 
theologian, by natural means.”* Or as another prominent evolutionary biologist, 
Francisco Ayala, has put it succinctly, natural selection explains “design without 
a designer.”° 

Other contemporary neo-Darwinian biologists including Richard Dawkins, 
Francis Crick, and Richard Lewontin have also emphasized that biological 
organisms only appear to have been designed.* They recognize that many 


biological structures—whether the chambered nautilus, the compound eye of a 
trilobite, the electrical system of the mammalian heart, or numerous molecular 
machines—attract our attention because the sophisticated organization of such 
systems is reminiscent of our own designs. Dawkins has noted, for example, that 
the digital information in DNA bears an uncanny resemblance to computer 
software or machine code.” He explains that many aspects of livings systems 
“sive the appearance of having been designed for a purpose.”© 

Nevertheless, neo-Darwinists regard that appearance of design as entirely 
illusory, as did Darwin himself, because they think that purely mindless, 
materialistic processes such as natural selection and random mutations can 
produce the intricate designed-like structures in living organisms. In this view, 
natural selection and random mutation mimic the powers of a designing 
intelligence without themselves being intelligently directed or guided in any 
way. 

That’s where the theory of intelligent design comes into play. Intelligent 
design challenges the idea that natural selection and random mutation (and other 
similarly undirected materialistic processes) can explain the most striking 
appearances of design in living organisms. Instead, it affirms that there are 
certain features of living systems that are best explained by the design of an 
actual intelligence—a conscious and rational agent, a mind—as opposed to a 
mindless, materialistic process. The theory of intelligent design does not reject 
“evolution” defined as “change over time” or even universal common ancestry, 
but it does dispute Darwin’s idea that the cause of major biological change and 
the appearance of design are wholly blind and undirected. 

Nor does the theory seek to insert into biology an extraneous religious 
concept. Intelligent design addresses a key scientific question that has long been 
part of evolutionary biology: Is design real or illusory? Indeed, part of what 
Darwin set out to explain was precisely the appearance of design. With current 
materialistic evolutionary theories now failing to explain many of the most 
striking appearances of design in the Cambrian animals, including the presence 
of digital information as well as other complex adaptations, the possibility 
emerges that these appearances of design may not be just appearances after all. 
The Darwinian formulation of evolutionary theory in opposition to the design 
hypothesis,’ coupled with the inability of neo-Darwinian and other materialistic 
theories to account for salient appearances of design, would seem logically to 
reopen the possibility of actual (as opposed to apparent) design in the history of 
animal life. 


Either life arose as the result of purely undirected material processes or a 
guiding or designing intelligence played a role. Advocates of intelligent design 
favor the latter option and argue that living organisms look designed because 
they really were designed. Design proponents argue that living systems exhibit 
telltale indicators of prior intelligent activity that justify this claim, indicators 
that make intelligent design scientifically detectable from the evidence of the 
living world. 

But that, for many evolutionary biologists, is precisely the rub. Because they 
think of intelligent design as a religiously based idea, they understand that 
people might want to affirm the intelligent design of life as part of their religious 
beliefs—but not as a consequence of scientific evidence. Indeed, most 
evolutionary biologists don’t see how the idea of intelligent design could 
contribute to a scientific explanation of life’s origins, nor do they see how 
intelligent design could ever be detected or inferred scientifically from evidence 
in nature. Exactly how would researchers justify such an inference? 


MY STORY WHEN I LEFT FOR MY GRADUATE STUDIES IN ENGLAND 
IN 1986, TWAS ASKING A SIMILAR SET OF QUESTIONS. AT THAT 
TIME, I WASN’T THINKING ABOUT THE SCIENTIFIC LEGITIMACY OF 
THE INTELLIGENT DESIGN HYPOTHESIS AS AN EXPLANATION FOR 
THE ORIGIN OF ANIMALS. INSTEAD, I WANTED TO KNOW IF 
INTELLIGENT DESIGN COULD HELP EXPLAIN THE ORIGIN OF LIFE 
ITSELF. MY QUESTIONS EVENTUALLY LED ME TO LEARN ABOUT A 
DISTINCTIVE METHOD OF HISTORICAL SCIENTIFIC INQUIRY. THAT 
DISCOVERY LED ME TO A METHOD OF REASONING THAT ALLOWS 
FOR THE DETECTION OR INFERENCE OF PAST CAUSES, INCLUDING 
INTELLIGENT CAUSES. 


A year earlier, in 1985, I had met one of the first contemporary scientists to 
revive the idea that intelligent design might have played a causal role in the 
origins of life. Chemist Charles Thaxton (see Fig. 17.1) had recently published a 
book, The Mystery of Life’s Origin. His coauthors were polymer scientist and 
engineer Walter Bradley and geochemist Roger Olsen. Their book received 
acclaim as a groundbreaking critique of current theories of chemical evolution. 
They showed that attempts to explain the origin of the first living cell from 
simpler nonliving chemicals had failed and that these theories had specifically 
failed to explain the origin of the information necessary to produce the first life. 


But it was in the book’s epilogue that the three scientists proposed a radical 
alternative. There they suggested that the information-bearing properties of DNA 
might point to the activity of a designing intelligence—to the work of a mind, or 
an “intelligent cause” as they put it. Drawing on the analysis of the British- 
Hungarian physical chemist Michael Polanyi, they argued that chemistry and 
physics alone could not produce the information in DNA any more than ink and 
paper alone could produce the information in a book. Instead, they argued that 
our uniform experience suggests a cause-and-effect relationship between 
intelligent activity and the production of information.’ 





FIGURE 17.1 
Charles Thaxton. Courtesy Charles Thaxton. 


At the time the book appeared, I was working as a geophysicist for an oil 
company in Dallas where Thaxton happened to live. I met him at a scientific 
conference and became intrigued by his work. Over the next year, I began 
dropping by his office to discuss his book and the radical idea he was developing 
about DNA. 

The first part of Thaxton’s argument made sense to me. Experience does 
indeed seem to affirm that (specified or functional) information typically arises 
from the activity of intelligent agents, from minds as opposed to mindless, 
material processes. When a “tweet” appears on your smart phone’s Twitter feed 


(if you’re into that kind of thing), it clearly originated first in the mind of a 
person who created a Twitter account, scripted the “tweet,” and then sent it out 
across the Internet. Information does arise from minds. 

But Thaxton went further. He acknowledged that most branches of science 
didn’t consider intelligent activity as an explanation because, he thought, 
intelligent agents don’t usually generate repeatable or predictable phenomena 
and because they are difficult to study under controlled laboratory conditions. 
Nevertheless, Thaxton argued that scientists might propose an intelligent cause 
as a positive scientific explanation for some events in the past, as part of a 
special mode of scientific inquiry he called origins science. He noted that 
scientific disciplines such as archaeology, evolutionary biology, cosmology, and 
paleontology often infer the occurrence of singular, nonrepeatable events and 
that the methods used to make such inferences could help scientists identify 
positive indicators of intelligent causes in the past as well. 

Here I wasn’t initially so sure. Thaxton’s ideas about a distinctive method of 
science concerned with origins, or at least with the past generally, seemed 
intuitively plausible. After all, evolutionary biologists and paleontologists do 
seem to use a method of investigation different from that employed by 
laboratory chemists. Nevertheless, I wasn’t exactly sure what those methods 
were, how they were different from those used in other sciences, and whether 
using them in any way justified considering intelligent design as a scientific 
hypothesis. 

So the next year when I left Dallas, Texas, for Cambridge, England, to pursue 
my studies in the history and philosophy of science, I had a lot on my mind. Is 
there a distinctive method of historical scientific inquiry? If so, does that method 
of reasoning and investigation justify a scientific reformulation of the design 
hypothesis? In particular, does the intuitive connection between information and 
the prior activity of a designing intelligence justify a positive (historical) 
scientific inference to intelligent design? Does it make intelligent design 
detectable? 


HISTORICAL SCIENTIFIC METHOD AND THE DESIGN 
HYPOTHESIS IN MY RESEARCH, I DISCOVERED THAT HISTORICAL 
SCIENTISTS OFTEN DO MAKE INFERENCES WITH A DISTINCTIVE 
LOGICAL FORM. THIS TYPE OF INFERENCE IS KNOWN TECHNICALLY 
AS AN ABDUCTIVE INFERENCE.!? DURING THE NINETEENTH 
CENTURY, AMERICAN LOGICIAN C. S. PEIRCE CHARACTERIZED THIS 


MODE OF REASONING AND DISTINGUISHED IT FROM TWO BETTER- 
KNOWN FORMS, INDUCTIVE AND DEDUCTIVE REASONING. HE 
NOTED THAT IN INDUCTIVE REASONING, GENERAL RULES ARE 
INFERRED FROM PARTICULAR FACTS, WHEREAS IN DEDUCTIVE 

REASONING, GENERAL RULES ARE APPLIED TO PARTICULAR FACTS 

IN ORDER TO DEDUCE SPECIFIC OUTCOMES. IN ABDUCTIVE 
REASONING, HOWEVER, INFERENCES ARE OFTEN MADE ABOUT 
PAST EVENTS OR CAUSES BASED ON PRESENT CLUES OR FACTS.!! 


To see the difference between these three types of inference, consider the 
following argument forms: Inductive argument: A, is B. 

A, is B. 

A3 is B. 

A, is B. 

A, is B. 

All A’s are B. 


Deductive argument: MAJOR PREMISE: If A has occurred, then B will follow as 
a matter of course. 

MINOR PREMISE: A has occurred. 

CONCLUSION: Hence, B will follow as well. 


Abductive argument: MAJOR PREMISE: If A occurs, then B would be expected 
as a matter of course. 

MINOR PREMISE: The surprising fact B is observed. 

CONCLUSION: Hence, there is reason to suspect that A has occurred. 


Note the difference between deductive and abductive forms of inference. In 
deduction, the minor premise affirms the antecedent variable (“A”), while the 
conclusion deduces the consequent variable (“B”), an anticipated outcome. In 
this sense, deductive inferences look forward to something that will happen in 
the future. A classic illustration of deductive reasoning has this character: MAJOR 
PREMISE: All men are mortal. 

MINOR PREMISE: Socrates is a man. 

CONCLUSION: Therefore, Socrates is a mortal (i.e., he will die). 


In an abductive argument, the minor premise affirms the consequent variable 
(“B”) and its conclusion infers the antecedent variable (“A”)—the variable 
referring to something that went before, either logically or temporally. Abductive 
reasoning, thus, often affirms a past occurrence. For this reason, forensic or 
historical scientists such as geologists, paleontologists, archaeologists, and 
evolutionary biologists often use abductive reasoning to infer past conditions or 
causes from present clues. As Stephen Jay Gould notes, historical scientists 
typically “infer history from its results.” ! 

For example, a geologist might reason as follows: MAJOR PREMISE: If a 
mudslide occurred, we would expect to find felled trees. 

MINOR PREMISE: We find evidence of felled trees. 

CONCLUSION: Therefore, we have reason to think that a mudslide may have 

occurred. 


In the deductive form, if the premises are true, the conclusion follows with 
certainty. The logic of the abductive arguments is different, however. Abductive 
arguments do not produce certainty, but instead merely plausibility or possibility. 
To see why, consider the following variation of the preceding abductive 
argument: MAJOR PREMISE: If a mudslide occurred, we would expect to find felled 
trees. 

MINOR PREMISE: We find felled trees. 

CONCLUSION: Therefore, a mudslide occurred. 


or symbolically: MAJOR PREMISE: If MS, then FT. 
MINOR PREMISE: F'T. 
CONCLUSION: Therefore, MS. 


Notice that unlike the first version of the abductive argument in which the 
conclusion was stated tentatively (“We have reason to think that a mudslide may 
have occurred”), in this version the conclusion is affirmed definitively (“A 
mudslide occurred”). Obviously, this latter form of argument has a problem. It 
does not follow that, because the trees have fallen, a mudslide necessarily 
occurred. The trees may have fallen for some other reason. A hurricane may 
have blown them down; perhaps an ice storm occurred and the trees fell under 
the weight of accumulating ice; or loggers may have cut them down. In logic, 
affirming the consequent variable of a minor premise (with certainty) constitutes 
a formal fallacy—a fallacy that derives from the failure to acknowledge that 


more than one cause (or antecedent) might produce the same evidence (or 
consequent). 

Even so, the presence of downed timber might indicate that a mudslide has 
occurred. Thus, amending the above argument to conclude: “We have reason to 
think that a mudslide may have occurred” does not commit a fallacy. Even if we 
may not affirm the consequent with certainty, we may affirm it as a possibility. 
This is precisely what abductive reasoning does. It provides a reason for 
considering that a hypothesis—and often a hypothesis about the past—might be 
true, even if one cannot affirm the hypothesis (or conclusion) with certainty. 


THE METHOD OF MULTIPLE COMPETING HYPOTHESES TO 
ADDRESS THIS LIMITATION IN ABDUCTIVE REASONING AND TO 
MAKE IT POSSIBLE TO STRENGTHEN INFERENCES ABOUT THE PAST, 
THE NINETEENTH-CENTURY GEOLOGIST THOMAS CHAMBERLAIN 
DEVELOPED A FORM OF REASONING HE CALLED “THE METHOD OF 
MULTIPLE WORKING HYPOTHESES.” 4 HISTORICAL AND FORENSIC 
SCIENTISTS EMPLOY THIS METHOD WHEN MORE THAN ONE CAUSE 
OR HYPOTHESIS CAN ACCOUNT FOR THE SAME EVIDENCE. THEY 
USE IT TO ADJUDICATE BETWEEN COMPETING HYPOTHESES BY 
COMPARING THEM TO SEE WHICH ONE BEST EXPLAINS NOT JUST 
ONE PIECE OF EVIDENCE BUT, USUALLY, A WIDER CLASS OF 
RELEVANT FACTS. 


For example, consider how this method of reasoning was used to establish the 
hypothesis of continental drift as the best explanation for a wide range of 
geological observations. During the early 1900s, a German geologist and 
meteorologist named Alfred Wegener became fascinated with the way the 
African and South American continents fit together on the map like pieces of a 
jigsaw puzzle.!° He proposed that the continents had once been fused together as 
a single giant continent that he called “Pangea,” which later separated and 
drifted apart. 

Initially, many geologists ridiculed Wegener’s idea. They thought that—given 
the vast distances separating the continents—the matching shapes were most 
likely just a coincidence. Wegener’s critics dismissed his theory of continental 
drift as “delirious ravings,” “Germanic pseudo-science,” or a “fairy tale.” !° But 
Wegener cited other evidence that he thought continental drift could explain that 
the coincidence hypothesis could not. He noted that fossil forms discovered on 


the east coast of South America matched those on the west coast of Africa in 
corresponding places and sedimentary strata. This fact seemed too coincidental 
to him to be explained away by chance alone. Nevertheless, other geologists 
attempted to explain matching fossil forms an ocean apart not as the result of the 
movement of the continents, but instead as the result of the migration of flora 
and fauna—either across oceans or over ancient land bridges.!’ This introduced 
a third hypothesis into the mix, one that, in conjunction with the coincidence 
hypothesis, could explain each of the same facts that Wegener’s 
hypothesis could. 

Later, however, an additional set of facts came to light—one that helped 
scientists to decide between the competing hypotheses. During World War II, the 
United States Navy surveyed the seafloor topography and measured the earth’s 
magnetic field across the oceans. These magnetic surveys showed parallel stripes 
of magnetized rock, each with the same polarity on either side of mountain 
ridges running down the middle of the ocean floor at equal distances from the 
mid-oceanic mountain ranges.!® Geologists also learned that magma was 
continually seeping out at the middle of these mid-oceanic mountain ranges. 
They discovered that as the magma cools, it “acquires” a characteristic magnetic 
signature reflecting the polarity of the earth’s magnetic field at that location at 
the time of its cooling. When ships towing sensitive magnetometers measured 
this “remanent magnetization,” scientists learned that the magnetization of the 
seafloor alternated between sections of “normal” and “reverse” polarity as the 
magnetometer was towed away from a mid-ocean ridge in each direction. This 
led to the discovery of a famous symmetrical “piano key” pattern on each side of 
a mid-ocean ridge, seen in Figure 17.2. 

To explain this symmetrical pattern of alternating magnetism, geologists 
proposed that the magnetic stripes were formed as the result of the seafloor 
spreading away from the mid-ocean ridge as magma was extruded and cooled in 
the presence of earth’s changing magnetic field—in other words, that the 
continents were literally drifting apart. This hypothesis not only explained the 
symmetrical pattern of magnetic stripes but also other relevant evidence. 
Although the other hypotheses could explain (or explain away) the fit of the 
continents and/or the similar pattern of fossilization across the oceans, only 
continental drift (driven by seafloor spreading) could explain the magnetic 
seafloor stripes and these other pieces of evidence. Consequently, as the result of 
its superior explanatory power, a decisive case for continental drift was soon 
established, strengthening a merely plausible abductive inference about the past 


movement of the continents by showing that this inference provided the best 
(and only adequate) explanation of all the relevant facts.'° 
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FIGURE 17.2 

This diagram shows the symmetrical pattern of alternating magnetic stripes of 
either “normal” or “reversed” polarity on either side of a mid-ocean ridge. 
Because this “piano key” pattern could only be explained by plate tectonics and 
seafloor spreading, it contributed to the widespread acceptance of those theories 
within contemporary geology. 


Contemporary philosophers of science such as Peter Lipton have called this 
method of reasoning “inference to the best explanation.”*° Scientists often use 
this method when trying to explain the origin of an event or structure from the 
past. They compare various hypotheses to see which would, if true, best explain 
it.*! They then provisionally affirm the hypothesis that best explains the data as 
the one that is most likely to be true. 

Obviously, saying, “The best explanation is the one that best explains the 
facts or that best explains the most facts,” begs an important question. What does 
it mean to explain something well or best? 

As it happens, historical scientists have developed criteria for deciding which 
explanation, among a group of competing possible hypotheses, provides the best 
explanation for some event in the remote past. The most important of these 
criteria is “causal adequacy.” As a condition of formulating a successful 
explanation, historical scientists must identify causes that are known to have the 
power to produce the kind of effect, feature, or event in question. In seeking to 
identify such causes, historical scientists evaluate hypotheses in light of their 
present knowledge of cause and effect. Causes that are known to produce the 


effect in question (or are thought capable of doing so) are judged to be better 
candidates than those that are not. For instance, a volcanic eruption provides a 
better explanation for an ash layer in the earth than an earthquake or a flood, 
because eruptions have been observed to produce ash layers, whereas 
earthquakes and floods have not. 

One of the first historical scientists to develop the criterion of causal 
adequacy was the geologist Charles Lyell (1797-1875), who in turn influenced 
Charles Darwin. Darwin read Lyell’s magnum opus, The Principles of Geology, 
during his voyage on the HMS Beagle and employed Lyell’s principles of 
reasoning in On the Origin of Species. The subtitle of Lyell’s book summarized 
his central methodological principle: Being an Attempt to Explain the Former 
Changes of the Earth’s Surface, by Reference to Causes Now in Operation 
(1830-1833). Lyell argued that when scientists seek to explain events in the past, 
they should not invoke some unknown type of cause, the effects of which we 
have not observed. Instead, they should cite causes that are known from our 
uniform experience to have the power to produce the effect in question.?+ 
Historical scientists should cite presently acting causes, that is, “causes now in 
operation.” This was the idea behind his uniformitarian method and its famous 
dictum: “The present is the key to the past.” According to Lyell, our present 
experience of cause and effect should guide our reasoning about the causes of 
past events. Darwin adopted this methodological principle as he sought to 
demonstrate that natural selection qualified as a vera causa, that is, a true, 
known, or actual cause of significant biological change.*° In other words, he 
sought to show that natural selection was “causally adequate” to produce the 
effects he was trying to explain. 


THE ONLY KNOWN CAUSE BOTH PHILOSOPHERS OF SCIENCE AND 
LEADING HISTORICAL SCIENTISTS HAVE EMPHASIZED CAUSAL 
ADEQUACY AS THE KEY CRITERION BY WHICH COMPETING 
HYPOTHESES ARE JUDGED. BUT PHILOSOPHERS OF SCIENCE HAVE 
INSISTED THAT ASSESSMENTS OF EXPLANATORY POWER LEAD TO 
CONCLUSIVE INFERENCES ONLY WHEN THERE IS JUST ONE KNOWN 
CAUSE FOR THE EFFECT OR EVIDENCE” (SEE FIG. 17.3) IN 
QUESTION. IF THERE ARE MANY CAUSES THAT CAN PRODUCE THE 
SAME EFFECT, THEN THE PRESENCE OF THE EFFECT DOES NOT 
DEFINITIVELY ESTABLISH THE CAUSE. WHEN SCIENTISTS KNOW OF 
ONLY ONE CAUSE FOR A GIVEN EFFECT, HOWEVER, THEY CAN 


INFER THAT CAUSE AND YET AVOID THE FALLACY OF AFFIRMING 
THE CONSEQUENT—THE ERROR OF IGNORING OTHER POSSIBLE 
CAUSES WITH THE POWER TO PRODUCE THE SAME EFFECT.*° IN 

THAT CASE, THEY CAN INFER OR DETECT A UNIQUELY PLAUSIBLE 

PAST CAUSE FROM THE CLUES THAT ARE LEFT BEHIND. 


This can happen in one of two ways. First, historical scientists might focus 
their investigation on a single fact (in isolation) for which only one cause 
happens to be known. In such a case, they can quickly and decisively infer the 
cause from the effect alone—without risk of affirming the consequent, because 
no other known cause produces the same effect. For example, because a volcanic 
eruption is the only known cause of a volcanic ash layer, the presence of such a 
layer at an archeological site strongly indicates the prior eruption of a volcano. 








FIGURE 17.3 

Schematic of the logical problem of retrodiction. Whether it is possible to 
reconstruct the past definitively or not depends upon whether there is a single 
cause or condition that gives rise to a present state or whether there are many 
possible past causes or conditions that give rise to a given present state. The 
diagram on the left portrays an information-destroying situation in which many 
past causes (or conditions) correspond to a given present state. The diagram on 
the right portrays an information-preserving situation in which only one past 
cause (or condition) corresponds to a present state. Adapted from Sober, 
Reconstructing the Past, 4. 


In other cases where historical scientists encounter evidence for which there 
are many known causes, they will often broaden their investigation beyond an 
initial fact or set of facts. In such cases, they will use the strategy described 
above (as part of the method of multiple competing hypotheses), by looking for 


additional evidence until they find a piece for which there is only one known 
cause. They can then compare the explanatory power of the competing 
hypotheses. Using this strategy, historical scientists will choose the proposed 
cause with the demonstrated power to produce all the relevant evidence, 
including the new fact or piece of evidence for which there is only one known 
cause. For example, the discovery of the symmetrical pattern of ocean-floor 
magnetism on opposite sides of a mid-oceanic ridge allowed for a comparison of 
the explanatory power of the three hypotheses under consideration, leaving only 
seafloor spreading as a causally adequate explanation of all the relevant facts. 
Such an approach often allows historical scientists to pick out a piece of 
evidence (from some combination of effects) for which there is only one known 
(or theoretically plausible) cause, thus making it possible to establish a past 
cause decisively. Though this strategy involves looking at a wider class of facts 
than the first strategy, the logical status of the inferences involved is the same. In 
each case, the presence of a fact (either standing on its own or in combination 
with other facts) for which only one cause is known allows historical scientists 
to make a definitive inference about the causal history in question without 
committing the fallacy of affirming the consequent. Logically, if a postulated 
cause is known to be a necessary condition or cause of a given event or effect, 
then historical scientists can validly infer that condition or cause from the 
presence of the effect. If it’s true that where there is smoke there is always first 
fire, then the presence of smoke wafting up over a distant mountain range 
decisively indicates the prior presence of a fire on the other side of the ridge. 


HISTORICAL INFERENCE AND INTELLIGENT DESIGN WHAT DOES 
ALL THIS HAVE TO DO WITH THE CAMBRIAN EXPLOSION? 


Quite a lot. In my investigation of the historical scientific method, I found 
that whether they always realize it or not, historical scientists typically use the 
method of inference to the best explanation. They make abductive inferences 
about past causes from present clues, evidence, or effects. This later suggested to 
me that if there were features of the Cambrian explosion or the Cambrian 
animals that would be “expected as a matter of course” if an intelligent designer 
had played a role in that event, then it was at least possible to formulate the 
hypothesis of intelligent design as a historical (abductive) scientific inference. 
An advocate of intelligent design could reason in a standard historical scientific 
way: MAJOR PREMISE: If intelligent design played a role in the Cambrian 


explosion, then feature (X) known to be produced by intelligent activity would 
be expected as a matter of course. 
MINOR PREMISE: Feature (X) is observed in the Cambrian explosion of animal 
life. 
CONCLUSION: Hence, there is reason to suspect that an intelligent cause played 
a role in the Cambrian explosion. 


Of course, a historical scientist would only be justified in making such an 
abductive inference to the past activity of an intelligent cause if “feature X” is 
evident in the Cambrian explosion and if intelligent design is known to produce 
“feature X.” Moreover, just because the Cambrian explosion may exhibit some 
feature or features for which intelligent design is a known cause does not mean 
that intelligent design was necessarily the actual cause (or the best explanation) 
of those features. Only if the Cambrian event and animals exhibit features for 
which intelligent design is the only known cause may a historical scientist make 
a decisive inference to a past intelligent cause. 

We are left with two crucial questions. Are there in fact such features present 
in the record of the Cambrian explosion or in the animals that arise in it— 
features that are known from our experience to be produced by intelligent causes 
such that they would justify making a tentative abductive inference to intelligent 
design? Are there also perhaps features of the Cambrian event that are known 
from our experience to be produced by intelligent causes, and only intelligent 
causes, justifying a more definitive inference to past intelligent activity as the 
best explanation for the relevant evidence? Might “the butler” have done it 
after all? 


18 


Signs of Design in the Cambrian Explosion 


Well-crafted mystery novels, like real-world crime investigations, unfold with a 
distinctive logic. There is a death to be explained and, at the start, an indefinitely 
large universe of possible causes. That universe can be made smaller, narrowing 
to the one true cause, as more and more clues come to light. Those clues 
typically come in two forms: positive evidence, or indicators of what likely 
happened (e.g., .38 caliber shell casings on the ground and bullet wounds in a 
body) and negative evidence, or indicators of what could not have happened. 

Let’s say that the local sheriff who discovered the body of the estate owner 
(from my illustration in the previous chapter), did so as he was making his 
rounds on a dirt road that makes a close approach to the beach at the end of the 
estate where the owner died. And let’s say that, as a result, the sheriff just 
happened to find the body soon after the murder had taken place. Let’s suppose, 
further, that the sheriff had the presence of mind to immediately measure the 
temperature of the body only to find that the victim was still warm, indeed 
almost as warm as a living person. Clearly, in this situation, the sheriff would 
conclude that the victim had just died. At that point in the investigation, a 
physical regularity would govern the sheriff’s reasoning, one that tells a lot about 
who didn’t commit the murder. Following death, the human body cools to the 
surrounding temperature at a known rate. So, making allowances for vehicular 
transport, whoever committed the murder could not have gone beyond a certain 
distance from the remote estate at the time the body was found. 

These facts would immediately provide a rock-solid alibi to the vast majority 
of humanity, anyone located safely outside that radius when the body was 
discovered. Of course, calling this information a negative clue is really only a 
convention of naming. “Negative” and “positive” refer to how we conceive of 
the implications of a fact, but not to the fact itself: the evidence, after all, is what 
it is. Even so, facts both exclude and allow competing possible hypotheses. As 
they accumulate, they typically paint a picture, a profile, of the actual cause of 


the event in need of explanation. Thus, when we say “the body temperature of 
the deceased rules out the 7 billion people who were well beyond the radius set 
by the cooling rate,” we could equally well have said, “the body temperature 
implicates some person within 30 miles of the estate when the sheriff arrived,” a 
population of possible suspects much smaller than when we started. 

As I have described the many attempts to explain the scientific enigma 
motivating this book, that mystery has, in one sense, progressively deepened. As 
more and more attempts to explain the Cambrian explosion of animal life have 
failed, the evidence that these various competing theories fail to explain may be 
considered a set of negative clues—evidence that effectively precludes certain 
possible causes or explanations. I’ve already explained why the received version 
of evolutionary theory, neo-Darwinism, fails to account for the explosion of 
information and form in the Cambrian period. I’ve also examined more recent 
evolutionary theories and shown why they too fail to explain key aspects of the 
evidence. To this point, then, much of the evidence has returned a negative 
verdict. It has told us a lot about what, in all probability, did not cause the 
Cambrian explosion. But, as in our hypothetical murder case, an accumulating 
body of evidence that makes one set of explanations less and less plausible may 
also begin to paint a picture of an alternative cause and the true explanation. 


PROFILE OF THE SUSPECT 


Long before detectives know the actual identity of a suspect, they will often 
compose a profile of the person they are seeking. One leading paleontologist has 
used this strategy to begin to draw a bead on the cause responsible for the 
Cambrian explosion. Douglas Erwin has dedicated his career to solving the 
problem of the origin of animal body plans (see Fig. 18.1). Trained at the 
University of California by James Valentine, another Cambrian expert, Erwin 
has worked closely over the past decade with Eric Davidson, whom we first met 
in Chapter 13, trying to determine what happened to cause dozens of novel body 
plans to appear—and appear rapidly—in the Cambrian period. 

Both Erwin and Davidson have now ruled out standard neo-Darwinian theory 
—vehemently in Davidson’s case. He says that the standard theory “gives rise to 
lethal errors.”! But Erwin and Davidson go further. They have assembled what 
is, in essence, a clue sheet—a list of key evidences that must be explained. Using 
that list, they have begun to sketch, at least in outline, a profile of the cause 
behind the Cambrian explosion. 

On the positive side of the ledger, they conclude that this cause must have 
several attributes in order to explain key facts about the fossil record as well as 
what it takes to build animals. In particular, the cause must be capable of 
generating a top-down pattern of appearance; it must be capable of generating 
new biological form relatively rapidly; and it must be capable of constructing, 
not merely modifying, complex integrated genetic circuits (specifically, the 
developmental gene regulatory networks discussed in Chapter 13). 

On the negative side, Davidson and Erwin rule out both observed 
microevolutionary processes and postulated macroevolutionary mechanisms 
(such as punctuated equilibrium and species selection) as explanations for the 
origin of the key features of the Cambrian explosion. They insist that the 
requirements for constructing animal body plans de novo “cannot be 
accommodated by microevolutionary [or] macroevolutionary theory.” 





FIGURE 18.1 
Douglas Erwin. Courtesy UPHOTO/Cornell University. 


In Chapter 13, I discussed their reason for coming to this conclusion: 
developmental gene regulatory networks, once in place, cannot be perturbed (or 
mutated) without “catastrophic”? consequences to the developing animal. Thus, 
fundamentally new gene regulatory networks (dGRNs) cannot evolve gradually 
from preexisting dGRNs, if those evolutionary changes require perturbing the 
deepest nodes of the earlier dGRNs. Yet building new dGRNs capable of 
producing new animals requires precisely such fundamental alterations in 
preexisting dGRNs. But then how would new regulatory networks ever arise? 
Davidson and Erwin insist that no current theory of evolution explains the origin 
of these systems. Thus, they conclude that the cause of the Cambrian explosion 
is not described by any currently proposed theory of micro-or macroevolution. 

In saying this, Erwin emphasizes the uniqueness of the innovations that 
occurred in the Cambrian explosion. He explains: “Unlike later events, the most 
significant developmental events of the Cambrian radiation involved the 
proliferation of cell types, developmental hierarchies and epigenetic cascades.”* 
Consequently, he concludes, “The crucial difference between the developmental 
events of the Cambrian and subsequent events is that the former involved the 
establishment of these developmental patterns, not their modification.”° For this 
reason, Erwin denies that the central event of the Cambrian explosion—the 
origin of novel body plans—has any parallel to currently observed biological 
processes. Rather, he insists that the events of the past were fundamentally 
different—that profound asymmetries exist between evolution then, and 
evolution now.° Thus, he amplifies his denial of the sufficiency of current 
evolutionary theory by adding one additional attribute, albeit a negative one, to 


his portrait of “the suspect”: the cause responsible for generating the new animal 
forms, whatever it was, must have been unlike any observed biological process 
operating in actual living populations today. 


PROFILING A CAUSE 


All of this raises an obvious question. Could the negative clues that increasingly 
disconfirm materialistic evolutionary theories also be positive indicators of a 
different kind of cause—perhaps even an intelligent cause? 

By sketching the profile of the kind of cause needed to explain the origin of 
animal life, Davidson, Erwin, and many other evolutionary biologists may have, 
inadvertently, rendered the idea of intelligent design a bit less inconceivable. To 
see why, let’s quickly review Erwin and Davidson’s profile of the suspect. They 
have concluded that the cause of the origin of the new animal forms in the 
Cambrian explosion must be capable of: 


e generating new form rapidly 
e generating a top-down pattern of appearance 
e constructing, not merely modifying, complex integrated circuits 


They have also concluded that this cause is: 


e not described by any currently proposed theory of micro-or 
macroevolution 

e unlike any observed biological process operating in actual living 
populations today 


Erwin and Davidson, no friends of intelligent design, have sketched a partial 
profile of an adequate cause as befits their particular interest in the importance of 
gene regulatory networks (Davidson) and fossil discontinuity (Erwin). But other 
evolutionary biologists have contributed to this picture as well. Simon Conway 
Morris marvels at “the uncanny ability of evolution to navigate to the 
appropriate solution through immense ‘hyperspaces’ of biological possibility.”” 
As a result, he argues that evolution may in some way be “channeled” toward 
propitious functional and/or structural end points—without specifying any 
known evolutionary mechanism that can so direct evolution to such end points.® 
James Shapiro proposes a mechanism of evolutionary change that relies on 
preprogrammed adaptive capacity—without explaining where — such 
preprogramming comes from. Several of the new evolutionary theories 
discussed in the previous chapters presuppose, but do not explain, the existence 


of both genetic and epigenetic forms of information, highlighting the need for a 
cause capable of generating such information in the first place. 

Erwin and Davidson have made a bold start with their clue list ruling out neo- 
Darwinism. But the evidence explored in these pages suggests additional 
attributes that need to be added to their profile of the actual cause of the 
Cambrian explosion. Our previous investigations have suggested that building an 
animal requires specified or functional information and that any explanation for 
the origin of the Cambrian animals must identify a cause capable of generating: 


e digital information 

e structural (epigenetic) information 

e functionally integrated and _ hierarchically organized layers of 
information 


Still, do any or even all of these clues add up to a reason for considering that 
an alternative kind of cause—a designing intelligence—might have played a role 
in the origin of animal life? 

They do. As it turns out, each of the features of the Cambrian animals and the 
Cambrian fossil record that constitute negative clues—clues that render neo- 
Darwinism and other materialistic theories inadequate as causal explanations— 
also happen to be features of systems known from experience to have arisen as 
the result of intelligent activity. In other words, standard materialistic 
evolutionary theories have failed to identify an adequate mechanism or cause for 
precisely those attributes of living forms that we know from experience only 
intelligence—conscious rational activity—is capable of producing. That 
suggests, in accord with the method of historical scientific reasoning elucidated 
in the previous chapter, the possibility of making a strong historical inference to 
intelligent design as the best explanation for the origin of those attributes. 

Let’s have a look at each of these features of the Cambrian event, starting 
with key features of the Cambrian animals themselves, to see how they might 
point to the past activity of a designing intelligence, thereby making intelligent 
design scientifically detectable. 


THE CAMBRIAN INFORMATION EXPLOSION 


We have seen that building a Cambrian (or any other) animal would require vast 
new, functionally specified digital information. Moreover, the presence of such 
digitally encoded information in DNA presents, at least, a striking appearance of 
design in all living organisms. As Richard Dawkins observes, for example, “The 
machine code of the genes is uncannily computer-like.”!° Similarly, 
biotechnology pioneer Leroy Hood refers to the information stored in DNA as 
“digital code” and describes it in terms reminiscent of computer software.'! And 
as we have seen, Microsoft’s Bill Gates notes: “DNA is like a computer program 
but far, far more advanced than any software ever created.” !2 

Yet we’ve also seen that neither neo-Darwinism nor any other materialistic 
evolutionary model or mechanism explains the origin of the genetic information 
(the digital code) necessary to produce the Cambrian animals or even the 
simplest structural innovations that they exhibit. Could this—from a materialistic 
point of view—unexplained appearance of design point instead to actual 
intelligent design? 

I think it does. But to explain why, I need to tell a bit more about the 
“evolution” of my own thinking on the matter. 

After learning about how historical scientists make inferences about the 
causes of events in the remote past, I first applied these methods of reasoning to 
the question of the origin of the information necessary to produce the first living 
cell. My book Signature in the Cell used the method of multiple competing 
hypotheses (or inference to the best explanation) to evaluate the “causal 
adequacy” of proposed explanations for the ultimate origin of biological 
information. I showed that chemical evolutionary models (whether based upon 
chance, physical-chemical necessity, or the combination of the two) failed to 
identify a cause capable of producing the digital information in DNA and RNA. 
Yet we do know of a cause that has demonstrated the causal power to produce 
digital code. That cause is intelligent agency. Since intelligent agency is the only 
cause known to be capable of generating information (at least starting from 
nonliving chemicals), intelligent design offers the best explanation for the origin 
of the information necessary to produce the first organism. 

The case for intelligent design in Signature was carefully limited as a 
challenge to chemical evolution. Many evolutionary biologists acknowledge that 
chemical evolutionary theory has failed to account for the origin of the first life. 


Many cite its inability to account for the origin of biological information as one 
of the main reasons for that failure. Moreover, because they do not think that 
natural selection could have played a significant role in evolution until after the 
first self-replicating organisms had arisen, most evolutionary biologists also 
think that explaining the origin of information in a prebiotic context is much 
more difficult than explaining the origin of new information in already living 
organisms. 

For this reason, in Signature I did not try to argue that intelligent design 
might help explain the origin of the information necessary to account for the 
origin of new animals from simpler preexisting forms of life. That would have 
required a separate demonstration showing the inadequacy of natural selection 
and mutation as a mechanism for generating new genetic information in already 
living organisms. This book—in Chapters 9-14—has_ provided that 
demonstration. These chapters show how neo-Darwinism fails to explain the 
origin of genetic information—at least, in amounts necessary to build a new 
protein fold. Chapters 15 and 16 showed, in addition, that the other main 
materialistic evolutionary theories also fail to account for the information 
necessary to build new forms of animal life. These theories presuppose, rather 
than explain, the origin of the information necessary for structural innovation in 
the history of life. And since the Cambrian explosion of animal life is an 
explosion of information and structural innovation, that raises a question. Is it 
possible that this increase of biological information not only represents evidence 
against materialistic theories of biological evolution, but also positive evidence 
for intelligent design? 


A Cause Now in Operation 


It does. Intelligent agents, due to their rationality and consciousness, have 
demonstrated the power to produce specified or functional information in the 
form of linear sequence-specific arrangements of characters. Digital and 
alphabetic forms of information routinely arise from intelligent agents. A 
computer user who traces the information on a screen back to its source 
invariably comes to a mind—a software engineer or programmer. The 
information in a book or inscription ultimately derives from a writer or scribe. 
Our experience-based knowledge of information flow confirms that systems 
with large amounts of specified or functional information invariably originate 
from an intelligent source. The generation of functional information is 
“habitually associated with conscious activity.”!° Our uniform experience 
confirms this obvious truth. 

It also suggests, therefore, that intelligent design meets the key “causal 
adequacy” requirement of a good historical scientific explanation. Certainly, 
intelligence is a “cause now in operation” capable of generating functional or 
specified information in a digital form. As I write this, my mind is generating 
specified information. Intelligent agents generate information in the form of 
software code, ancient inscriptions, books, encrypted military codes, and much 
else. And since we know of no “presently acting” materialistic cause that also 
generates large amounts’ of specified information (especially in a digital or 
alphabetic form), only intelligent design meets the causal adequacy requirement 
of a historical scientific explanation. In other words, our uniform experience of 
cause and effect shows that intelligent design is the only known cause of the 
origin of large amounts of functionally specified digital information. It follows 
that the great infusion of such information in the Cambrian explosion points 
decisively to an intelligent cause. 

Intelligent design stands alone as an explanation for the origin of genetic 
information for another reason: purposive agents have just those necessary 
powers that natural selection lacks as a condition of its causal adequacy. We have 
seen that natural selection lacks the ability to generate novel information 
precisely because it can only act after new functional information has arisen. 
Natural selection can favor new proteins and genes, but only after they perform 
some function (influencing reproductive output). The job of generating new 
functional genes, proteins, and systems of proteins therefore falls entirely to 


random mutations. Yet without functional criteria to guide a search through the 
space of possible sequences, random variation is probabilistically doomed. What 
is needed is not just a source of variation (i.e., the freedom to search a space of 
possibilities) or a mode of selection that can operate after the fact of a successful 
search, but instead a means of selection that (a) operates during a search—before 
success—and that (b) is guided by information about or knowledge of a 
functional target. 

Demonstration of this requirement has come from an unlikely quarter: genetic 
algorithms. Genetic algorithms are programs that allegedly simulate the creative 
power of mutation and selection. Richard Dawkins, Bernd-Olaf Kiippers, and 
others have developed computer programs that putatively simulate the 
production of genetic information by mutation and natural selection.!° Yet these 
programs succeed only by the illicit expedient of providing the computer with a 
“target sequence” and then treating proximity to future function (i.e., the target 
sequence), not actual present function, as a selection criterion. As mathematician 
David Berlinski shows, genetic algorithms need something akin to a “forward- 
looking memory” in order to succeed.'© Yet such foresighted selection has no 
analogue in nature. In biology, where differential survival depends upon 
maintaining function, natural selection cannot occur before new functional 
sequences arise. Natural selection lacks foresight; the process, as evolutionary 
theorists Rodin and Szathmary note, works strictly “ ‘in the present moment,’ 
right here and right now .. . lacking the foresight of potential future 
advantages.” !7 

What natural selection lacks, intelligent design—purposive, goal-directed 
selection—provides. Rational agents can arrange both matter and symbols with 
distant goals in mind. They also routinely solve problems of combinatorial 
inflation. In using language, the human mind routinely “finds” or generates 
highly improbable linguistic sequences to convey an intended or preconceived 
idea. In the process of thought, functional objectives precede and constrain the 
selection of words, sounds, and symbols to generate functional (and meaningful) 
sequences from a vast ensemble of meaningless alternative possible 
combinations of sound or symbol.!® Similarly, the construction of complex 
technological objects and products, such as bridges, circuit boards, engines, and 
software, results from the application of goal-directed constraints.!? Indeed, in 
all functionally integrated complex systems where the cause is known by 
experience or observation, designing engineers or other intelligent agents applied 
constraints on the possible arrangements of matter to limit possibilities in order 


to produce improbable forms, sequences, or structures. Rational agents have 
repeatedly demonstrated the capacity to constrain possible outcomes to actualize 
improbable but initially unrealized future functions. Repeated experience affirms 
that intelligent agents (minds) uniquely possess such causal powers. 

Analysis of the problem of the origin of biological information, therefore, 
exposes a deficiency in the causal powers of natural selection and other 
undirected evolutionary mechanisms that corresponds precisely to powers that 
agents are uniquely known to possess. Intelligent agents have foresight. Such 
agents can determine or select functional goals before they are physically 
instantiated. They can devise or select material means to accomplish those ends 
from among an array of possibilities. They can then actualize those goals in 
accord with a preconceived design plan or set of functional requirements. 
Rational agents can constrain combinatorial space with distant information-rich 
outcomes in mind. The causal powers that natural selection lacks—by definition 
—are associated with the attributes of consciousness and rationality—with 
purposive intelligence. Thus, by invoking intelligent design to overcome a vast 
combinatorial search problem and to explain the origin of new specified 
information, contemporary advocates of intelligent design are not positing an 
arbitrary explanatory element unmotivated by a consideration of the evidence. 
Instead, we posit an entity possessing precisely the causal powers that a key 
feature of the Cambrian explosion—the explosive increase in specified 
information—requires as a condition of its production and explanation. 


INTEGRATED CIRCUITRY: DEVELOPMENTAL GENE 
REGULATORY NETWORKS 


Keep in mind, too, that animal forms have more than just genetic information. 
They also need tightly integrated networks of genes, proteins, and other 
molecules to regulate their development—in other words, they require 
developmental gene regulatory networks, the dGRNs that Eric Davidson has so 
meticulously mapped over the course of his career. Developing animals face two 
main challenges. First, they must produce different types of proteins and cells 
and, second, they must get those proteins and cells to the right place at the right 
time.*? Davidson has shown that embryos accomplish this task by relying on 
networks of regulatory DNA-binding proteins (called transcription factors) and 
their physical targets. These physical targets are typically sections of DNA 
(genes) that produce other proteins or RNA molecules, which in turn regulate the 


expression of still other genes. 

These interdependent networks of genes and gene products present a striking 
appearance of design. Davidson’s graphical depictions of these dGRNs look for 
all the world like wiring diagrams in an electrical engineering blueprint or a 
schematic of an integrated circuit, an uncanny resemblance Davidson himself 
has often noted. “What emerges, from the analysis of animal dGRNs,” he muses, 
“is almost astounding: a network of logic interactions programmed into the DNA 
sequence that amounts essentially to a hardwired biological computational 
device.”*! These molecules collectively form a tightly integrated network of 
signaling molecules that function as an integrated circuit. Integrated circuits in 
electronics are systems of individually functional components such as 
transistors, resistors, and capacitors that are connected together to perform an 
overarching function. Likewise, the functional components of dGRNs—the 
DNA-binding proteins, their DNA target sequences, and the other molecules that 
the binding proteins and target molecules produce and regulate—also form an 
integrated circuit, one that contributes to accomplishing the overall function of 
producing an adult animal form. 

Yet, as explained in Chapter 13, Davidson himself has made clear that the 
tight functional constraints under which these systems of molecules (the dGRNs) 
operate preclude their gradual alteration by the mutation and selection 
mechanism. For this reason, neo-Darwinism has failed to explain the origin of 
these systems of molecules and their functional integration. Like advocates of 
evolutionary developmental biology, Davidson himself favors a model of 
evolutionary change that envisions mutations generating large-scale 
developmental effects, thus perhaps bypassing nonfunctional intermediate 
circuits or systems. Nevertheless, neither proponents of “evo-devo,” nor 
proponents of other recently proposed materialistic theories of evolution, have 
identified a mutational mechanism capable of generating a dGRN or anything 
even remotely resembling a complex integrated circuit. Yet, in our experience, 
complex integrated circuits—and the functional integration of parts in complex 
systems generally—are known to be produced by intelligent agents— 
specifically, by engineers. Moreover, intelligence is the only known cause of 
such effects. Since developing animals employ a form of integrated circuitry, and 
certainly one manifesting a tightly and functionally integrated system of parts 
and subsystems, and since intelligence is the only known cause of these features, 
the necessary presence of these features in developing Cambrian animals would 
seem to indicate that intelligent agency played a role in their origin (see Fig. 


13.4). 


THE HIERARCHICAL ORGANIZATION OF GENETIC AND 
EPIGENETIC INFORMATION 


In addition to the information stored in individual genes and the information 
present in the integrated networks of genes and proteins in dGRNs, animal forms 
exemplify hierarchical arrangements or layers of information-rich molecules, 
systems, and structures. For example, developing embryos require epigenetic 
information in the form of specifically arranged (a) membrane targets and 
patterns, (b) cytoskeletal arrays, (c) ion channels, and (d) sugar molecules on the 
exterior of cells (the sugar code). As noted in Chapter 13, much of this 
epigenetic information resides in the structure of the maternal egg and is 
inherited directly from membrane to membrane independently of DNA. 

This three-dimensional structural information interacts with other 
information-rich molecules and systems of molecules to ensure the proper 
development of an animal. In particular, epigenetic information influences the 
proper positioning and thus the function of regulatory proteins (including DNA- 
binding proteins), messenger RNAs, and various membrane components. 
Epigenetic information also influences the function of developmental gene 
regulatory networks. Thus, information at a higher structural level in the 
maternal egg helps to determine the function of both whole networks of genes 
and proteins (dGRNs) and individual molecules (gene products) at a lower level 
within a developing animal. Genetic information is necessary to specify the 
arrangement of amino acids in a protein or bases in an RNA molecule. Similarly, 
dGRNs are necessary to specify the location and/or function of many gene 
products. And, in a similar way, epigenetic information is necessary to specify 
the location and determine the function of lower-level molecules and systems of 
molecules, including the dGRNs themselves. 

Furthermore, the role of epigenetic information provides just one of many 
examples of the hierarchical arrangement (or layering) of information-rich 
structures, systems, and molecules within animals. Indeed, at every level of the 
biological hierarchy, organisms require specified and highly improbable 
(information-rich) arrangements of lower-level constituents in order to maintain 
their form and function. Genes require specified arrangements of nucleotide 
bases; proteins require specified arrangements of amino acids; cell structures and 
cell types require specified arrangements of proteins or systems of proteins; 


tissues and organs require specific arrangements of specific cell types; and body 
plans require specialized arrangements of tissues and organs. Animal forms 
contain information-rich lower-level components (such as proteins and genes). 
But they also contain information-rich arrangements of those components (such 
as the arrangement of genes and gene products in dGRNs or proteins in 
cytoskeletal arrays or membrane targets). Finally, animals also exhibit 
information-rich arrangements of higher-level systems and structures (such as 
the arrangements of specific cell types, tissues, and organs that form specific 
body plans). 

The highly specified, tightly integrated, hierarchical arrangements of 
molecular components and systems within animal body plans also suggest 
intelligent design. This is, again, because of our experience with the features and 
systems that intelligent agents—and only intelligent agents—produce. Indeed, 
based on our experience, we know that intelligent human agents have the 
capacity to generate complex and functionally specified arrangements of matter 
—that is, to generate specified complexity or specified information. Further, 
human agents often design information-rich hierarchies, in which both individual 
modules and the arrangement of those modules exhibit complexity and 
specificity—specified information as defined in Chapter 8. Individual transistors, 
resistors, and capacitors in an integrated circuit exhibit considerable complexity 
and specificity of design. Yet at a higher level of organization, the specific 
arrangement and connection of these components within an integrated circuit 
requires additional information and reflects further design (see Fig. 14.2). 

Conscious and rational agents have, as part of their powers of purposive 
intelligence, the capacity to design information-rich parts and to organize those 
parts into functional information-rich systems and hierarchies. We know of no 
other causal entity or process that has this capacity. Clearly, we have good reason 
to doubt that mutation and selection, self-organizational processes, or any of the 
other undirected processes cited by other materialistic evolutionary theories, can 
do so. Thus, based upon our present experience of the causal powers of various 
entities and a careful assessment of the efficacy of various evolutionary 
mechanisms, we can infer intelligent design as the best explanation for the origin 
of the hierarchically organized layers of information needed to build the animal 
forms that arose in the Cambrian period. 


LOCATION, LOCATION, LOCATION 


There is another remarkable aspect of the hierarchical organization of 
information in animal forms. Many of the same genes and proteins play very 
different roles, depending upon the larger organismal and informational context 
in which they find themselves in different animal groups.** For example, the 
same gene (Pax-6 or its homolog, called eyeless), helps to regulate the 
development of the eyes of fruit flies (arthropods) and those of squid and mice 
(cephalopods and vertebrates, respectively). Yet arthropod eyes exemplify a 
completely different structure from vertebrate or cephalopod eyes. The fruit fly 
possesses a compound eye with hundreds of separate lenses (ommatidia), 
whereas both mice and squid employ a camera-type eye with a single lens and 
retinal surface. In addition, although the eyes of squid and mice resemble each 
other optically (single lens, large internal chamber, single retinal surface), they 
focus differently. They undergo completely different patterns of development and 
utilize different internal structures and nerve connections to the visual centers of 
the brain. Yet the Pax-6 gene and its homologs play a key role in regulating the 
construction of all three of these different adult sensory structures. Moreover, 
evolutionary and developmental biologists have found that this pattern of “same 
genes, different anatomy” recurs throughout the bilaterian phyla, for features as 
fundamental as appendages, segmentation, the gut, heart, and sense organs (see 
Fig. 18.2).?° 

This pattern contradicts the expectations of textbook evolutionary theory. 
Neo-Darwinism predicts that disparate adult structures should be produced by 
different genes. This prediction follows directly from the neo-Darwinian 
assumption that all evolutionary (including anatomical) transformations begin 
with mutations in DNA sequences—mutations that are fixed in populations by 
natural selection, genetic drift, or other evolutionary processes. The arrow of 
causality flows one way from genes (DNA) to development to adult anatomy. 
Thus, if biologists observe different animal forms, it follows that they should 
expect that different genes will specify those forms during animal development. 
Given the profound differences between the fruit-fly compound eye and the 
vertebrate camera eye, neo-Darwinian theory would not predict that the “same” 
genes would be involved in building different eyes in arthropods and 
chordates.** 





A. MAMMAL _B. INSECT C. CEPHALOPOCD 





FIGURE 18.2 
The same genes can be used in different animals to produce dramatically 
different structures, contradicting neo-Darwinian expectations. 


Many leading evolutionary theorists have acknowledged this problem. 
University of Wisconsin evo-devo researcher Sean B. Carroll has noted that the 
neo-Darwinian prediction of similar genes producing similar structures is 
“entirely incorrect.”*? Stephen Jay Gould described the discovery of the 
polyfunctional role of similar genes as “explicitly unexpected” and 
“discombobulat[ing] the confident expectations of orthodox theory.”*° 

The theory of intelligent design suggests a solution to the problem—a 
solution familiar to us from the construction and operation of our own artifacts. 
Figure 18.3 shows a general-purpose switching transistor. These electronic 
components can be used to help build many electronic systems, from a computer 
to a microwave oven to a radio. And the exact functional role that the transistor 
will play will be governed by the system in which it finds itself. (One must, of 
course, make allowances for the particular specifications of the transistor itself; a 
transistor cannot function as a battery). 

Nowhere, however, is this feature of polyfunctional modularity more 
intuitively clear than in our use of natural languages, such as English. To 
illustrate this, my colleague Paul Nelson once “disassembled” the last forty-four 
words of Abraham Lincoln’s Gettysburg Address (see Fig. 18.4) into a lexicon. 
Using the same words, at roughly the same frequency, he then wrote an 
“Anarchist’s Manifesto,” with a meaning diametrically opposite to that of 
Lincoln’s. What had changed were not the words—that is, the lower-level 
modules. Rather, the higher-level context, or the system as a whole, differed. 
Lincoln wrote with one meaning in mind, the anarchist with another—and that 
made all the difference to the functions of the elements, or modules, within the 
respective systems. Nelson calls this context dependency “the organismal 


context principle” when it occurs in biology and he likens it to the context 
dependence of words in language or low-level parts in a technological system. 








FIGURE 18.3 

A general purpose transistor—an example of a component that can be used to 
perform different functions in different designed systems. Courtesy 
iStockphoto.com/S230. 
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FIGURE 18.4 

This comparison of Lincoln’s Gettysburg Address and an “Anarchist’s 
Manifesto” shows how the same modular elements (words) can perform 
different functions depending on their surrounding context, just as many genes 
do in biological systems. Courtesy Paul Nelson. 


He also argues that intelligent design provides a compelling explanation for 
the presence of polyfunctional modularity in living systems. Why? Not only is 
the polyfunctionality of genetic modules unexpected in a neo-Darwinian view, it 
is a common feature of intelligently designed systems. As Nelson and Jonathan 
Wells note, “An intelligent cause may reuse or redeploy the same module in 
different systems, without there necessarily being any material or physical 
connection between those systems.”*’ They also observe that intelligent agents 
“can generate identical patterns independently” and put them to different uses in 
different systems of parts: 


If we suppose that an intelligent designer constructed organisms using a 
common set of polyfunctional genetic modules—just as human designers, 
for instance, may employ the same transistor or capacitor in a car radio 
or a computer, .. . then we can explain why we find the “same” genes 
expressed in the development of what are very different organisms. ...A 
particular gene, employed for its DNA-binding properties, finds its 
functional role in a higher-level system whose ultimate origin was 
intelligently caused.78 


Wells and Nelson go on to explain that “the overall system, not the gene 
itself’ determines the ultimate functional significance of the lower level 
modules, just as it does in all human technological or communication systems. 
Certainly, both the computer software and computer hardware (integrated 
circuits) exhibit this feature—what might be called “context-dependent, 
polyfunctional modularity.” Similarly, in information-rich texts—such as the 
Gettysburg Address or the Anarchist’s Manifesto—humans convey different 
meanings with the same low-level modules (words) depending upon their 
surrounding context. Experience shows that when we know how systems 
possessing this feature arose, invariably they arose by intelligent design. 


FEATURES OF THE PRE-CAMBRIAN-CAMBRIAN FOSSIL RECORD 


Intelligent design not only helps to explain many key features of the Cambrian 
animals themselves; it also helps to explain many otherwise anomalous features 
of the Cambrian fossil record. 


An Inverted Cone: Disparity Preceding Diversity 


As discussed in Chapter 2, the fossil record shows a “top-down” pattern in which 
phyla-level morphological disparity appears first followed only later by species- 
level diversity. Major innovations in body plans precede minor variations on 
basic designs.*? This “inverted cone of diversity” also suggests intelligent 
design. 

Neo-Darwinism seeks to explain the origin of novel body plans by starting 
with simpler body plans and gradually assembling animals with more complex 
body plans via the gradual accumulation of small successive material variations. 
Thus, neo-Darwinism employs a “bottom-up” mode of causation. With a bottom- 


up approach, small-scale diversification should eventually produce large-scale 
morphological disparity—differences in body plan. The “bottom-up” metaphor 
thus describes a kind of self-assembly in which the gradual production of the 
material parts eventually generates the organization of the whole. This suggests 
in turn that the parts stand causally prior to the organization of the whole. As I 
have argued, however, this approach encounters both paleontological and 
biological difficulties: the fossil record leaves no evidence of such a process and 
the morphological innovation and transformations that it requires are, in any 
case, biologically implausible. 

But if a bottom-up approach fails, perhaps a “top-down” approach will 
succeed. “Top-down” causation begins with a basic architecture, blueprint, or 
plan and then proceeds to assemble parts in accord with it. The blueprint stands 
causally prior to the assembly and arrangement of the parts. But where could 
such a blueprint come from? One possibility involves a mental mode of 
causation. Intelligent agents often conceive of plans prior to their material 
instantiation—that is, the preconceived design of a blueprint often precedes the 
assembly of parts in accord with it. An observer touring the parts section of a 
General Motors plant will see no direct evidence of a prior blueprint for GM’s 
new models, but will perceive the basic design plan immediately upon observing 
the finished product at the end of the assembly line. Designed systems, whether 
automobiles, airplanes, or computers, invariably manifest a design plan that 
preceded their first material instantiation. But the parts do not generate the 
whole. Rather, an idea of the whole directed the assembly of the parts. 

This form of causation can certainly explain the pattern in the fossil record. 
As new species appear in the Cambrian, they manifest completely novel, 
morphologically disparate, and functionally integrated body plans. Thus, 
although the fossil record does not directly establish the existence of a prior 
mental plan or blueprint, such a plan could certainly explain, or be inferred from, 
the top-down pattern of fossil evidence. In other words, if the body plans of the 
Cambrian animals did arise as the result of a “top-down” mode of causation 
involving a preconceived design plan, we would expect, based on our experience 
of complex designed systems, to find precisely the pattern of evidence that we 
do see in the fossil record. Further, materialistic “bottom-up” models of 
causation fail to explain this same pattern of fossil evidence. Thus, intelligent 
design provides a better explanation of this feature of the Cambrian fossil record 
than do competing materialistic evolutionary theories. 

The design hypothesis can also explain why smaller-scale diversity arises 


after, not before, morphological disparity in the fossil record or, to put it more 
poetically, why the basic themes of life precede the variation on those themes. 
Complex designed systems have a fundamental functional integrity that makes 
their alteration difficult. For this reason, we should not expect gradual 
mechanisms of change to produce new body plans or alter them fundamentally 
after they have arisen. We might, however, expect to find variations on these 
basic themes within the functional limits established by a basic architecture or 
body plan. Fundamentally new forms of organization require design from 
scratch. For example, airplanes did not arise gradually or incrementally from 
automobiles. Nevertheless, new innovations often accrete to novel designs 
provided the fundamental organizational plan is not altered. 

Since the invention of the automobile, all cars have included the same basic 
structural and functional elements, including a motor, at least three (and usually 
four) wheels, a carriage with seats for passengers, a structure connecting the 
wheels to the carriage, a steering wheel and column (or analogous mechanism), 
and a means of translating energy generated by the motor to the wheels. These 
are minimal requirements, of course; many cars have used axles to connect the 
wheels, though some have not and a “stretch” limousine may need additional 
axles or wheels. Indeed, though many new variations on the original model have 
arisen after the invention of the basic automobile design, all exemplify this same 
basic design. Interestingly, we also observe this pattern in the fossil record. The 
major animal body plans appear first instantiated by a single (or very few) 
species or genera. Then, later, many other varieties arise with many new 
features, yet all still exhibit the same basic body plan. 

Experience shows a hierarchical relationship between functionally necessary 
and functionally optional features in designed systems. An automobile cannot 
function without a motor or steering mechanism; it can function with or without 
twin I-beam suspension, antilock brakes, or “stereo surround sound.” This 
distinction between functionally necessary and optional features suggests the 
possibility of future innovation and variation on basic design plans, even as it 
imposes limits on the extent to which the basic designs themselves can be 
altered. The logic of designed systems, therefore, suggests precisely the kind of 
top-down pattern that we see in both the history of our own technological 
innovation*? and in the history of life following the Cambrian explosion 
(compare Figs. 18.5a and 18.5b). On the other hand, competing materialistic 
evolutionary theories would not lead us to expect the fossil record to manifest 
such a “top-down” pattern, but the opposite. 


Sudden Appearance and Missing Ancestors 


The theory of intelligent design can also help to account for the abrupt 
appearance of complex anatomical structures and animal body plans in the fossil 
record. Intelligent agents sometimes produce material entities through a series of 
gradual modifications (as when a sculptor shapes a sculpture over time). 
Nevertheless, intelligent agents also have the capacity to introduce complex 
technological systems into the world fully formed. Often such systems bear no 
resemblance to earlier technological systems—their invention occurs without a 
material connection to earlier, more rudimentary technologies. When the radio 
was first invented, it was unlike anything that had come before, even other forms 
of communication technology. For this reason, although intelligent agents need 
not generate novel structures abruptly, they can do so. Thus, invoking the 
activity of a mind provides a causally adequate explanation for the pattern of 
abrupt appearance in the Cambrian fossil record. 
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FIGURE 18.5 

Figure 18.5a (top): A top-down pattern of appearance found in the history of 
animal life. Figure 18.5b (bottom): A top-down pattern of appearance in human 
technology. 


On the other hand, strictly materialistic theories of evolution necessarily 
envision a “bottom-up” mode of causation in which material parts or materially 
instantiated intermediate forms of organization necessarily precede the 
emergence of fully developed body plans. For this reason, the sudden appearance 
of novel animal forms contradicts the expectations of most materialistic 
evolutionary theories. Neo-Darwinism, in particular, would not expect the 
sudden appearance of animal forms. As Darwin himself insisted: “Natura non 
facit saltum” (“Nature takes no leaps”). Yet intelligent agents can act suddenly 
or discretely in accord with their powers of rational choice or volition, even if 
they do not always do so. Thus, the sudden appearance of the Cambrian animals 


does suggest, at least, the possibility of a volitional act of a conscious agent—a 
designer. 

Intelligent design likewise helps explain the absence of ancestral precursors. 
If body plans arose as the result of an intelligent agent actualizing an immaterial 
plan or idea, then an extensive series of material precursors to the first animals 
need not exist in the fossil record, anymore than such a series is always present 
in the history of technology. The radio did not evolve gradually from the 
telegraph. Mental plans or concepts need not leave a material trace. Thus, 
intelligent design can account for the dearth of material precursors in the 
Precambrian strata, whereas “bottom-up” materialistic evolutionary theories 
cannot, especially given the failure of the artifact hypothesis discussed earlier. 


Stasis (or Persistent Morphological Isolation) 


Finally, intelligent design also explains the observed stasis in the fossil record. 
As advocates of punctuated equilibrium established, Cambrian species tend to 
persist unchanged in their basic forms over time. Animal body plans that define 
the higher taxa, including classes and phyla, also remain especially stable in their 
basic architectural designs, showing “no directional change”?! over geological 
history after their first appearance in the Cambrian. As a result, the 
morphological disparity between distinct animal body plans remains unbridged. 
Moreover, as noted in Chapter 13, developmental mechanisms constrain the 
degree to which organisms may vary without deleterious consequences. 


PERSISTENT MORPHOLOGICAL ISOLATION 
OF ANIMAL FORMS 


PHYLUM PHYLUM 





CTENCPHORA CHORDATA 
PRESENT 
DISPARITY IN FORM 
CAMBRIAN 





PHYLUM PHYLUM 
"AUTOMOBILIA" "AIRPLANTA" 








FIGURE 18.6 


Comparison of persistent morphological isolation in two animal forms that first arose in the Cambrian 
period, and similar morphological isolation in two technological systems. 


The persistent morphological disparity and isolation of animal body plans is 
completely unexpected by neo-Darwinian, and all other gradualist, evolutionary 
theories, at least. Given such models of evolutionary change, theorists should 
expect the fossil record to exhibit forms of life that grade imperceptibly from 
one to another. Indeed, absent either a compelling version of the artifact 
hypothesis or an adequate mechanism of punctuated large-scale evolutionary 
change, “morphospace” should be mostly filled in. The fossil record should not 
display mostly morphologically disparate or separate forms of animal life. 

Yet, if living systems did arise as the result of intelligent design, then such 
morphological isolation, and such persistent isolation over time, is just what we 
should expect to see—precisely because that is what we do see in the history of 
other intelligently designed systems (see Fig. 18.6). Indeed, experience suggests 


that designed objects have a functional integrity that makes the modification of 
some of their essential parts and their basic organization and architecture 
difficult or impossible. Though the Model-A has been replaced by everything 
from the Yugo to the Honda Accord, the automobile “body plan” with several 
essential functional and/or structural elements has remained unchanged from its 
first appearance in the late nineteenth century. 

Further, despite the design of many innovative variations, automobiles have 
retained their “morphological distance” or structural disparity from other 
functionally distinct technological devices. Indeed, persistent morphological 
disparity in biological systems (manifested as stasis in the fossil record) has a 
direct parallel in our own technology. In biology, what we recognize as different 
organismal body plans are systems that differ fundamentally from each other in 
their overall organization. A crab and a starfish, for example, may exhibit some 
similarities in their low-level protein parts, but they differ fundamentally in their 
digestive and nervous systems and in the overall organization of their organs and 
body parts. In the same way, automobiles and airplanes may have many similar 
parts, even as they differ in the composition of their distinguishing parts and 
overall organization. 

The presence of such structural disparities and isolation among complex 
functionally integrated systems represents another distinctive feature of the 
intelligently designed systems known from our own world of technology. For 
example, the basic technology of the CD-ROM (as employed in audio systems 
and computers) did not “evolve” incrementally from earlier technologies, such 
as magnetic media (e.g., digital tape or disc storage) or analog systems such as 
the once standard long-playing (LP) record. Indeed, it could not. In an analog 
recording, information is stored as three-dimensional microscopic grooves in a 
vinyl surface and is detected mechanically by a diamond stylus. This means of 
storing and detecting information differs fundamentally, as a system, from the 
digitally encoded pits storing data in the silvered surface of a CD-ROM, where 
information is detected optically, not mechanically, by a laser beam. The CD- 
ROM had to be engineered from scratch and, as a result, displays a striking 
structural difference and isolation from other technological devices, even those 
that perform roughly the same function. Although minor new features may 
“accrete” to its basic design architecture, a deep and impassable functional gulf 
separates the CD-ROM as a system from other technological systems. As 
biologist Michael Denton expresses it, “What is true of sentences and watches is 
also true of computer programs, airplane engines, and in fact of all known 


complex systems. Almost invariably, function is restricted to unique and 
fantastically improbable combinations of subsystems, tiny islands of meaning 
lost in an infinite sea of incoherence.”°* In fact, such structural disparity or 
morphological isolation constitutes a diagnostic of designed systems—that is, a 
feature of systems for which only one kind of cause—an intelligent cause—is 
known. 


ACTS OF MIND 


Studies in the history and philosophy of science have shown that to explain an 
event or a set of facts, scientists must typically cite a cause capable of producing 
that event or those facts. When scientists do not have the luxury of directly 
observing the cause of a particular event or effect under study, as historical 
scientists typically do not, they must cite a cause that is otherwise known to 
produce the facts in question. That means historical scientists must show that the 
event or facts of interest must in some way represent the expected outcome of a 
particular cause having acted in the past—that the event or facts should have 
occurred “as a matter of course.” 

To many scientists, especially those steeped in the materialistic assumptions 
of contemporary scientific culture, the idea of intelligent design seems inherently 
implausible or even incoherent. Science to them involves not only observing and 
studying material entities and phenomena, but explaining them by reference to 
materialistic entities. For these scientists, it makes no sense even to consider the 
idea of intelligent design, with its explicit reference to the activity of a designing 
mind. 

Yet it turns out that both the Cambrian animal forms themselves and their 
pattern of appearance in the fossil record exhibit precisely those features that we 
should expect to see if an intelligent cause had acted to produce them (see Fig. 
18.7). Further, the Cambrian animal forms and their manner of appearance 
contradict what we should expect to find in the fossil record and in the animal 
world given a purely materialistic “bottom-up” process of evolution. Thus, 
despite its potential for disturbing the materialistic sensibilities of many 
scientists, it is hard logically to avoid the conclusion that the design hypothesis 
actually provides a better, more casually adequate explanation for key features of 
the Cambrian event. 

When Darwin first acknowledged the problem of the Cambrian fossil record, 
and the small but persistent doubt it raised for him about his theory, his nemesis 
Louis Agassiz not only rejected his theory of evolution, but also affirmed an 
alternative understanding of the nature and origin of animal life. To Agassiz, the 
pattern of animal classification and the fossil record reinforced the idea that 
living forms exemplified basic “types”—ideas that had originated in the mind of 
a designing intelligence. Thus, he would argue that the Cambrian fossils tell of 
“acts of mind.”°° 


As noted in Chapter 1, Darwin himself acknowledged both Agassiz’s 
immense paleontological knowledge and the validity of the problems that 
Agassiz raised. Even so, his affirmation of a positive alternative to Darwin’s 
theory in the form a design hypothesis might well have seemed premature in the 
1860s and certainly did reflect something of the prejudice of the times. But more 
than a century and a half later, after many failed attempts to discover—and 
explain away—the missing fossil ancestors, and after discoveries in molecular 
and developmental biology have revolutionized our understanding of the 
complexity of animal life, continuing to regard the Cambrian explosion as 
merely a niggling problem for established theory—a lone question mark or 
negative clue—now seems not so much cautious, as simply unresponsive to the 
evidence. 
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FIGURE 18.7 
Both the Cambrian animal forms and their pattern of appearance in the fossil 


record exhibit distinctive features or hallmarks of designed systems—features 
that we should expect to see if an intelligence acted to produce them. 


The animal forms that arose in the Cambrian not only did so without any 
clear material antecedent; they came on the scene complete with digital code, 
dynamically expressed integrated circuitry, and multi-layered, hierarchically 
organized information storage and processing systems. 

In light of these marvels and the persistent pattern of the fossil record, should 
we now continue, as Darwin did (who knew nothing of them), to regard the 
Cambrian explosion as just an anomaly? Or may we now consider the features of 
the Cambrian event as evidence supporting another view of the origin of animal 
life? If so, is there now a compelling logic for considering a different kind of 
causal history? 

In fact, there is. The features of the Cambrian event point decisively in 
another direction—not to some as-yet-undiscovered materialistic process that 
merely mimics the powers of a designing mind, but instead to an actual 
intelligent cause. When we encounter objects that manifest any of the key 
features present in the Cambrian animals, or events that exhibit the patterns 
present in the Cambrian fossil record, and we know how these features and 
patterns arose, invariably we find that intelligent design played a causal role in 
their origin. Thus, when we encounter these same features in the Cambrian 
event, we may infer—based upon established cause-and-effect relationships and 
uniformitarian principles—that the same kind of cause operated in the history of 
life. In other words, intelligent design constitutes the best, most causally 
adequate explanation for the origin of information and circuitry necessary to 
build the Cambrian animals. It also provides the best explanation for the top- 
down, explosive, and discontinuous pattern of appearance of the Cambrian 
animals in the fossil record. 


19 


The Rules of Science 


The argument of the previous chapter raises an obvious question. If intelligent 
design provides such a clear and satisfying resolution to the mystery of the 
Cambrian explosion, why have so many brilliant scientists missed it? 

While reflecting on this question, I came across a short story by G. K. 
Chesterton called “The Invisible Man,” which may cast some light on it. In “The 
Invisible Man,” Chesterton tells the story of someone who is murdered in an 
apartment with only one entrance, an entrance watched by four honest men. 
These men insist that during their watch no one entered or left the building. A 
brilliant French detective investigates the case, along with his friend, a dusty 
little Catholic priest. They query the guards, each of whom insists that no one 
entered or exited the building. But then the unimpressive looking priest, Father 
Brown, all but forgotten in the background, pipes up to ask, “Has nobody been 
up and down stairs, then, since the snow began to fall?” 

“Certainly not,” they assure him. 

“Then I wonder what that is?” Father Brown asks, gazing at the white snow 
on the outside entrance stairs. Everyone turns to find a “stringy pattern of grey 
footprints” there. 

“God!” one of them cries, “An invisible man!” 

After asking a few more questions, Father Brown quickly unravels the 
mystery. “When those four quite honest men said that no man had gone into the 
Mansions, they did not really mean that no man had gone into them,” Father 
Brown explains to his detective friend. “They meant no man whom they could 
suspect of being your man. A man did go into the house and did come out of it, 
but they never noticed him.” 

“An invisible man?” 

“A mentally invisible man,” the priest explains. 

What does a mentally invisible man look like? 

“He is dressed rather handsomely in red, blue and gold,” the priest explains, 


“and in this striking, and even showy, costume he entered Himylaya Mansions 
[the name of the apartment complex] under eight human eyes; he killed . . . [the 
murder victim] in cold blood, and came down into the street again carrying the 
dead body. .. . You have not noticed such a man as this.” 

At that moment, he reaches out and puts his hand on “an ordinary passing 
postman,” one who had almost slipped by them unnoticed. 

“Nobody ever notices postmen somehow,” Father Brown muses. “Yet they 
have passions like other men, and even carry large bags where a small corpse 
can be stowed quite easily.”! 

The passing postman, of course, is the murderer. He walked up and down the 
Stairs under the four men’s noses, but because of their mental blinders telling 
them whom to consider and whom to ignore, they overlooked the postman 
entirely. 

The theme is a favorite of detective-story authors: the obvious possibility 
missed by the experts, because their assumptions prevent them from considering 
what might otherwise seem to be an obvious possibility. Could something like 
that be at work in the investigation of the Cambrian explosion? Could 
evolutionary biologists and paleontologists be wearing a set of mental blinders 
that keeps them from considering a possible explanation of the Cambrian 
mystery? 

Odd as it may seem, that is exactly what has been going on in the 
investigation of the Cambrian explosion. In this case, however, those wearing the 
mental blinders have elevated an unwillingness to consider certain explanations 
to a principle of scientific method. That principle is called “methodological 
naturalism” or “methodological materialism.” Methodological naturalism asserts 
that to qualify as scientific, a theory must explain phenomena and events in 
nature—even events such as the origin of the universe and life or phenomena 
such as human consciousness—by reference to strictly material causes. 
According to this principle, scientists may not invoke the activity of a mind or, 
as one philosopher of science puts it, any “creative intelligence.” 

To see how adherence to this principle has prevented scientists from 
considering a possibly true (even “causally adequate”) explanation for the 
Cambrian explosion, let’s revisit the case reported in Chapter 11 of Richard 
Sternberg (see Fig. 19.1), the evolutionary biologist at the Smithsonian’s 
National Museum of Natural History. After Sternberg published my article 
arguing for intelligent design as the best explanation of the Cambrian 
information explosion in the technical journal Proceedings of the Biological 


Society of Washington,’ he suffered professional retribution at the hands of 
Smithsonian administrators. The Biological Society of Washington, the 
governing body that oversees the publication of the journal that Sternberg then 
edited, also issued a public statement repudiating his decision.° Its statement did 
not, however, cite any factual errors in the article or seek to rebut it. Further, the 
president of the society, Smithsonian zoologist Roy McDiarmid, wrote Sternberg 
privately and told him that he (McDiarmid) had reviewed the file containing the 
peer-review reports and had found everything to be in order.® 

What, then, had Sternberg done to deserve public rebuke? 

Sternberg published a paper that violated a presumed rule of science: 
methodological naturalism. Without saying it in so many words, the Biological 
Society made crystal clear that this was the crucial issue. When it distanced itself 
from Sternberg and the review essay, it did not invite a scientific refutation of the 
article, as if the problem had been a misrepresentation or misinterpretation of the 
evidence. Instead, it attempted to settle the issue by releasing a policy statement. 
As a writer in the Wall Street Journal reported at the time, “The Biological 
Society of Washington released a vaguely ecclesiastical statement regretting its 
association with the article. It did not address its arguments but denied its 
orthodoxy, citing a resolution of the American Association for the Advancement 
of Science that defined ID [the theory of intelligent design] as, by its very nature, 
unscientific.”” 





FIGURE 19.1 
Richard Sternberg. Courtesy Laszlo Bencze. 


The Biological Society of Washington “deemed the paper inappropriate for 
the pages of the Proceedings.”® The Society attempted to justify this claim, first, 
on thin procedural grounds, claiming that a paper about the origin of animal 
body plans represented a “departure” from its more typical concern with issues 
of animal classification. Second, and more tellingly, it cited the policy statement 
of the American Association for the Advancement of Science (AAAS) “calling 
upon its members to understand the nature of science” and to recognize “the 
inappropriateness of ‘intelligent design theory’ as subject matter for science 
education.”’ Setting aside the obvious point that my paper was written not as a 
curricular manifesto but as an evidence-based scientific argument, the AAAS 
statement affirmed an implicitly and strictly materialistic understanding of the 
nature of science. It did so to disqualify intelligent design from consideration— 
not only in science education, but in science itself. 

The Sternberg case—like numerous others in which the academic freedom of 
scientists advocating intelligent design has been abridged!°—goes a long way to 
answering the question of why so many otherwise brilliant and knowledgeable 
scientists have overlooked such a seemingly obvious possible answer to the 
Cambrian conundrum. As in Chesterton’s story about the invisible postman, they 
have accepted a self-imposed limitation on the hypotheses they are willing to 
consider. These scientists think they are doing their duty to science. Yet if 
researchers refuse as a matter of principle to consider the design hypothesis, they 
will obviously miss any evidence that happens to support it. And the cultural 
pressure within biology to avoid considering the intelligent design hypothesis 
has long been nontrivial. Francis Crick, for example, famously admonished 
biologists to “constantly keep in mind that what they see was not designed, but 
rather evolved.”!! In 1997, in an article in the New York Review of Books, 
Harvard geneticist Richard Lewontin made explicit a similar commitment to a 
strictly materialistic explanation—whatever the evidence might seem to indicate. 
As he explained in a now often quoted passage: 


We take the side of science in spite of the patent absurdity of some of its 
constructs, in spite of its failure to fulfill many of its extravagant promises 
of health and life, in spite of the tolerance of the scientific community for 
unsubstantiated just-so stories, because we have a prior commitment, a 


commitment to materialism. It is not that the methods and institutions of 
science somehow compel us to accept a material explanation of the 
phenomenal world, but, on the contrary, that we are forced by our a priori 
adherence to material causes to create an apparatus of investigation and a 
set of concepts that produce material explanations, no matter how 
counter-intuitive, no matter how mystifying to the uninitiated. Moreover, 
that materialism is absolute, for we cannot allow a Divine Foot in the 
door!” 


The commitment to methodological naturalism that Lewontin describes, as 
well as the behavior of scientists in cases such as Sternberg’s, leave no doubt that 
many in science simply will not consider the design hypothesis as an explanation 
for the Cambrian explosion or any other event in the history of life, whatever the 
evidence. To do so would be to violate the “rules of science” as they understand 
them. 


BUT IS IT SCIENCE? 


But are these scientists right? Perhaps science must limit itself to purely 
naturalistic or materialistic explanations. If so, are there perhaps good reasons 
for excluding the design hypothesis from consideration as a_ scientific 
hypothesis? Is methodological naturalism the correct policy for science? 

Though scientists routinely assert methodological naturalism as a scientific 
norm, that principle and its exclusion of the design hypothesis have proven 
difficult to justify. To claim that a specific theory does not qualify as scientific 
requires a definition of science or a set of definitional criteria by which to make 
that kind of a judgment. Some philosophers and scientists have asserted that for 
a scientific theory to qualify as scientific, it must meet various criteria of 
testability, falsifiability, observability, repeatability, and the like. Philosophers of 
science call these “demarcation criteria,” because some scientists purport to use 
them to define or “demarcate” science and to distinguish it from pseudo science 
or from other forms of inquiry such as history, religion, or metaphysics.!* 


THE GENERAL PROBLEM OF DEMARCATION 


The demarcation question has long been a vexing one. Historically, scientists and 
philosophers of science have thought that science could be distinguished by its 
especially rigorous method of study. But attempts to define science by reference 
to a distinctive method have proven problematic because different branches and 
types of science use different methods. 

For example, some scientific disciplines distinguish and classify natural 
entities, while others attempt to formulate overarching laws that apply to all 
entities. Some disciplines perform laboratory experiments under controlled and 
replicable conditions, while others attempt to reconstruct or explain singular 
events in the past, often based on field studies of evidence or clues rather than 
laboratory experiments. Some disciplines generate mathematical descriptions of 
natural phenomena without positing mechanisms to explain them. Others look 
for mechanisms or explain law-like regularities by reference to underlying 
mechanisms. Some scientific disciplines make predictions to test theories, while 
others test competing theories by comparing their explanatory power. Some 
disciplines use both these methods, while some conjectures (particularly in 
theoretical physics) may not be testable at all. And on it goes. 

An episode in the history of science illustrates the problem. During the 
seventeenth century, a group of scientists called the “mechanical philosophers” 
insisted, based largely on advances in early chemistry, that scientific theories 
must provide mechanistic explanations. Such explanations had to involve one 
material entity pushing or pulling another. Yet in physics, Isaac Newton (1642— 
1727) formulated an important theory that provided no mechanistic explanation. 
His theory of universal gravitation described mathematically, but did not explain 
in a mechanistic way, the gravitational attraction between planetary bodies— 
bodies separated from each other by miles of empty space with no means of 
mechanical interaction with each other whatsoever.!* Despite provocation from 
the German mathematician Gottfried Wilhelm Leibniz (1646-1716), who 
defended the mechanistic ideal, Newton expressly refused to give any 
explanation, mechanistic or otherwise, for the mysterious “action at a distance” 
that his theory described.!° 

Did that make Newton’s theory unscientific? Strictly speaking, the answer 
depends upon which definition of science someone chooses to apply. Today one 
would be hard-pressed to find anybody who denies that Newton’s famous theory 


qualified as scientific. Yet we could easily find scientists still willing to say 
scientific theories must provide mechanisms as well as others who would deny 
as much. 

And that is the problem. If scientists and philosophers of science do not have 
an agreed-upon definition of science, how can they settle questions about which 
theories do and do not qualify as scientific? If scientists lack such a definition, 
it’s difficult to argue that any particular theory is unscientific by definition. For 
this reason, philosophers of science, the scholars who study the nature and 
definition of science, now almost universally reject the use of demarcation 
arguments to decide the validity of theories or settle competition between 
them.'° They increasingly regard demarcation as an essentially semantic 
question and nothing more. Is theory X scientific or not? Answer: that depends 
upon which definition of science is used to decide the question. 

Moreover, as the philosopher of science Larry Laudan has shown in a seminal 
article, “The Demise of the Demarcation Problem,” attempts to apply 
demarcation criteria to decide the scientific status of specific theories have 
invariably generated irreconcilable contradictions.'” The vortex theory of gravity 
that Newton’s theory replaced envisioned planets swirling around the sun pushed 
by a substance called ether.!® It did provide a mechanistic explanation for 
gravitational attraction. It failed, however, to explain the evidence and was 
judged by Newton and physicists following him to be manifestly false. 
Nevertheless, because it proposed a mechanistic cause of gravitation, it qualified 
as “scientific’”—at least given the conception of science favored by Leibniz and 
the mechanical philosophers.!? Conversely, Newton’s theory failed to as 
scientific by their definition, though it much more accurately fit the evidence. 

Such contradictions have long beset the whole enterprise of demarcation. 
Theories that scientists have rejected as false because of their inability to explain 
or describe the evidence often meet the very criteria or methodological features 
(testability, falsifiability, repeatability, observability, etc.) that allegedly 
characterize true science. On the other hand, many highly esteemed or successful 
theories often lack allegedly necessary features of genuine science. 

Thus, philosophers of science generally think it much more important to 
assess whether a theory is true, or whether the evidence supports it, than whether 
it should or should not be classified as “science.” The question of whether a 
theory is “scientific” is really a red herring. What we really want to know is 
whether a theory is true or false, supported by the evidence or not, worthy of our 
belief or not. And we cannot decide those questions by applying a set of abstract 


criteria that purport to tell in advance what all good scientific theories must look 
like.*° 


DEFINE AND DISMISS: DEMARCATION ARGUMENTS AGAINST 
INTELLIGENT DESIGN 


The rejection of demarcation arguments among philosophers of science has not 
stopped critics of intelligent design from attempting to settle debates about 
biological origins by the expedient of formulating such arguments against 
intelligent design. Some use these arguments to justify methodological 
naturalism (which has the same effect). 

Advocates of methodological naturalism have argued that the theory of 
intelligent design is inherently unscientific for some, or all, of the following 
reasons: (a) is not testable,*! (b) is not falsifiable,2* (c) does not make 
predictions,~° (d) does not describe repeatable phenomena, (e) does not explain 
by reference to natural law,** (f) does not cite a mechanism,7° (g) does not make 
tentative claims,*° and (h) has no problem-solving capability.*” They have also 
claimed that it is not science because it (i) refers to an unobservable entity.7® 
These critics also assume, imply, or assert that materialistic evolutionary theories 
do meet such criteria of proper scientific method. 

Readers may wish to consult Signature in the Cell for a more detailed 
response to these specific arguments. There I show that many of these claims are 
simply false (e.g., contrary to the claims of its critics intelligent design is 
testable; it does make predictions; it does formulate its claims tentatively; and it 
does have scientific problem-solving capability). But I also show that when the 
claims of those making demarcation arguments are true—when intelligent design 
doesn’t meet a specific criterion—that fact does not provide good reason for 
excluding intelligent design from consideration as a scientific theory. Why? 
Because the materialistic evolutionary theories that intelligent design challenges, 
theories widely regarded by convention as “scientific,” fail to meet the very same 
demarcation standard. In other words, there is no defensible definition of 
science, and no specific demarcation criterion, that justifies both excluding 
intelligent design from science and including competing materialistic 
evolutionary theories. Instead, attempts to use demarcation criteria specifically 
to disqualify intelligent design as a scientific theory have repeatedly failed to 
differentiate the scientific status of intelligent design from that of competing 
theories. Depending upon which criteria are used to adjudicate their scientific 


status, and provided metaphysically neutral criteria are selected to make such 
assessments, intelligent design and materialistic origins theories invariably prove 
equally scientific or unscientific. 

For example, some critics of intelligent design have argued that it fails to 
qualify as a scientific theory because it makes reference to an unseen or 
unobservable entity, namely, a designing mind in the remote past. Yet many 
accepted theories—theories assumed to be scientific—postulate unobservable 
events and entities. Physicists postulate forces, fields, and quarks; biochemists 
infer submicroscopic structures; psychologists discuss their patients’ mental 
states. Evolutionary biologists themselves infer unobserved past mutations and 
invoke the existence of extinct organisms and transitional forms for which no 
fossils remain. Such things, like the actions of an intelligent designer, are 
inferred from observable evidence in the present, because of the explanatory 
power they may offer. 

If the demarcation criterion of observability is applied rigidly, then both 
intelligent design and materialistic theories of evolution fail to qualify as 
scientific. If the standard is applied more liberally (or realistically)— 
acknowledging the way in which historical scientific theories often infer 
unobservable past events, causes, or entities—then both theories qualify as 
scientific. 

And so it goes with other such criteria as well. There is no specific (non- 
question-begging) demarcation criterion that succeeds in disqualifying the theory 
of intelligent design from consideration as a scientific theory without also doing 
the same to its materialistic rivals. 


REASONS TO REGARD INTELLIGENT DESIGN AS A SCIENTIFIC 
THEORY 


Demarcation arguments fail to justify excluding intelligent design from science. 
But it turns out that there are some good—if convention-dependent—reasons to 
regard intelligent design as a scientific theory. 

For example, many scientists and philosophers of science regard testability as 
an important feature of scientific inquiry. And intelligent design is testable in 
three specific and interrelated ways. First, like other scientific theories concerned 
with explaining events in the remote past, intelligent design is testable by 
comparing its explanatory power with that of competing theories. Second, 
intelligent design, like other historical scientific theories, is tested against our 


knowledge of the cause-and-effect structure of the world. As we have discussed, 
historical scientific theories provide adequate explanations when they cite causes 
that are known to produce the effects in question or “causes now in operation.”7? 
Because of this, the plausibility of historical scientific theories, including 
intelligent design, can be tested by reference to independent knowledge of cause- 
and-effect relationships. Third, although historical scientific theories typically 
cannot be tested under controlled laboratory conditions, they do sometimes 
generate predictions that enable scientists to compare their merit to that of other 
theories. Intelligent design has generated a number of specific empirical 
predictions that distinguish it from competing evolutionary theories and that 
serve to confirm the design hypothesis over its competitors. (In Signature in the 
Cell, I described ten such predictions that the theory of intelligent design has 
generated).°0 

There is another compelling, if convention-dependent, reason to regard 
intelligent design as a scientific theory. The inference to intelligent design is 
based upon the same method of historical scientific reasoning and the same 
uniformitarian principles that Charles Darwin used in On the Origin of Species. 
The similarity in logical structure runs quite deep. Both the argument for 
intelligent design and the Darwinian argument for descent with modification 
were formulated as abductive inferences to the best explanation. Both theories 
address characteristically historical questions; both employ typically historical 
forms of explanation and testing; and both have metaphysical implications. 
Insofar as we regard Darwin’s theory as a scientific theory, it seems appropriate 
to designate the theory of intelligent design as a scientific theory as well. 

Indeed, neo-Darwinism and the theory of intelligent design are not two 
different kinds of inquiry, as some critics have asserted. They are two different 
answers—formulated using a similar logic and method of reasoning—to the 
same question: “What caused biological forms and the appearance of design to 
arise in the history of life?” It stands to reason that if we regard one theory, neo- 
Darwinism or intelligent design, as scientific, we should regard the other as the 
same. Of course, whether either theory is true or not is another matter. An idea 
may be scientific and incorrect. In the history of science, many theories have 
proven to be so. The vortex theory of gravity, to which I referred earlier, would 
be one of nearly countless illustrations. 

For readers who would like to consider more detailed responses to arguments 
about whether intelligent design qualifies as “science,” I recommend Chapters 
18 and 19 in Signature in the Cell.*! In Signature, I respond in detail to other 


philosophical objections to the case for intelligent design. These include 
challenges such as: (a) intelligent design is religion, not science,’* (b) the case 
for intelligent design is based on flawed analogical reasoning, (c) intelligent 
design is a fallacious argument from ignorance, sometimes called the “God of 
the Gaps” objection, (d) intelligent design is a science stopper, (e) the famous 
zinger, popularized by Richard Dawkins, that asks “Who designed the 
designer?”*? and many others. 


ANEW OBJECTION TO THE SCIENTIFIC STATUS OF INTELLIGENT 
DESIGN 


Since the publication of Signature in the Cell, Robert Asher, a University of 
Cambridge paleontologist, has offered another reason to contest my 
characterization of intelligent design as a scientific theory. In his book, Evolution 
and Belief, he challenges my claim to have used the uniformitarian method of 
Lyell and Darwin to develop the case for intelligent design. Since his objection is 
new, published only in 2012 by Cambridge University Press, it deserves 
discussion. 

Asher characterizes my thinking as follows: “The processes we know and 
observe today are relevant to explaining the phenomena of the past, and we 
know that particularly complicated things we see today have an intelligence 
behind them.”°* He notes that I argue certain complex technologies, such as 
computer software, have “only one source: human ingenuity.”°° It follows, 
according to Asher’s paraphrase of my argument that “a similarly complex 
device we observe in the geological past must also have arisen as a result of 
something like human ingenuity, i.e., intelligence.”°° 

Asher doesn’t seem to understand the importance of specified information, as 
opposed to “complicated things,” as a key indicator of design. That aside, he 
does claim to recognize the role of uniformitarian principles of reasoning in my 
argument for intelligent design. In spite of this, Asher elsewhere disputes that I 
employ the uniformitarian method of reasoning. Why? According to Asher, the 
inference to intelligent design is actually “anti-uniformitarian” because it doesn’t 
provide a “mechanism.” As he puts it, “by attempting to replace a causal 
mechanism (natural selection) with an attribution of agency (design), ID 
advocates such as Meyer are decidedly anti-uniformitarian. What process of 
today could possibly lead to his understanding of the past?”?’ 

The answer to Asher’s question seems pretty obvious. The answer is: 


intelligence. Conscious activity. The deliberate choice of a rational agent. 
Indeed, we have abundant experience in the present of intelligent agents 
generating specified information. Our experience of the causal powers of 
intelligent agents—of “conscious activity” as “a cause now in operation”— 
provides a basis for making inferences about the best explanation of the origin of 
biological information in the past. In other words, our experience of the cause- 
and-effect structure of the world—specifically the cause known to produce large 
amounts of specified information in the present—provides a basis for 
understanding what likely caused large increases in specified information in 
living systems in the past. It is precisely my reliance on such experience that 
makes possible an understanding of the type of causes at work in the history of 
life. It also makes my argument decidedly uniformitarian—not “anti- 
uniformitarian”—in character. 

Asher confuses the uniformitarian imperative in historical scientific 
explanations (the need to cite a presently known or adequate cause) with a 
demand for citing a material cause, or mechanism. The theory of intelligent 
design does cite a cause, and indeed one known to produce the effects in 
question, but it does not necessarily cite a mechanistic or materialistic cause. 
Proponents of intelligent design may conceive of intelligence as a strictly 
materialistic phenomenon, something reducible to the neurochemistry of a brain, 
but they may also conceive of it as part of a mental reality that is irreducible to 
brain chemistry or any other physical process. They may also understand and 
define intelligence by reference to their own introspective experience of rational 
consciousness and take no particular position on the mind-brain question. 

Asher assumes that intelligent design denies a materialistic or “physicalist” 
account of the mind (as I personally do, in fact) and rejects it as unscientific on 
that basis. But he offers no noncircular reason for making that judgment. He 
cannot say that the principle of methodological naturalism requires that all 
genuinely scientific theories invoke only mechanistic causes, because the 
principle of methodological naturalism itself needs justification. And asserting 
that “all genuinely scientific theories must provide mechanisms” is just to restate 
the principle of methodological naturalism in different words. Indeed, to say that 
all scientific explanations must provide a mechanism is equivalent to saying that 
they must cite materialistic causes—precisely what the principle of 
methodological naturalism asserts. Asher seems to be assuming without 
justification that all scientifically acceptable causes are mechanistic or 
materialistic. His argument thus assumes a key point at issue, which is whether 


there are independent—that is, metaphysically neutral—reasons for requiring 
historical scientific theories to cite materialistic causes in their explanations as 
opposed to explanations that invoke possibly immaterial entities such as creative 
intelligence, mind, mental action, agency, or intelligent design. 

In any case, he confuses the logical requirement of citing a vera causa, a true 
or known cause, with an arbitrary requirement to cite only materialistic causes. 
He confuses uniformitarianism with methodological naturalism.°° He then 
critiques my design argument for rejecting the former, though it only rejects the 
latter. In so doing, he imposes an additional requirement on explanations of past 
events that leads him to mistake my argument as anti-uniformitarian and to miss 
the evidence for intelligent design. His implicit commitment to methodological 
naturalism makes the evidence for intelligent design—‘“the postman,” as it were 
—mentally invisible to him. 

Nevertheless, the concern that he raises about the theory of intelligent design 
not citing a mechanism still troubles people. In fact, I frequently get questions 
about this issue. People will ask something like this: “I can see your point about 
digital code providing evidence for intelligent design, but how exactly did the 
designing intelligence generate that information or arrange matter to form cells 
or animals?” Or: “How did the intelligent designer that you infer impress its 
ideas on matter to form animals?” As Asher puts it, “How could a biological 
phenomenon, even if designed, be simply willed into existence without an actual 
mechanism?”°? 

To help clear things up, several points need to be considered. First, the theory 
of intelligent design does not provide a mechanistic account of the origin of 
biological information or form, nor does it attempt to. Instead, it offers an 
alternative causal explanation involving a mental, rather than a necessarily or 
exclusively material, cause for the origin of that reality. It attributes the origin of 
information in living organisms to thought, to the rational activity of a mind, not 
a strictly material process or mechanism. That does not make it deficient as a 
materialistic or mechanistic explanation. It makes it an alternative to that kind of 
explanation. Advocates of intelligent design do not propose intelligent causes 
because they cannot think of a possible mechanistic explanation for the origin of 
form or information. They propose intelligent design because they think it 
provides a better, more causally adequate explanation for these realities. Given 
what we know from experience about the origin of information, materialistic 
explanations are the deficient ones. 

There is a different context in which someone might want to ask about a 


mechanism. He or she may wish to know by what means the information, once 
originated, is transmitted to the world of matter. In our experience, intelligent 
agents, after generating information, often use material means to transmit that 
information. A teacher may write on a chalkboard with a piece of chalk or an 
ancient scribe may have chiseled an inscription in a piece of rock with a metal 
implement. Often, those who want to know about the mechanism of intelligent 
design are not necessarily challenging the idea that information ultimately 
originates in thought. They want to know how, or by what material means, the 
intelligent agent responsible for the information in living systems transmitted 
that information to a material entity such as a strand of DNA. To use a term from 
philosophy, they want to know about “the efficient cause” at work. 

The answer is: We simply don’t know. We don’t have enough evidence or 
information about what happened, in the Cambrian explosion or other events in 
the history of life, to answer questions about what exactly happened, even 
though we can establish from the clues left behind that an intelligent designer 
played a causal role in the origin of living forms. 

An illustration from archaeology helps explain how this can be so (see Fig. 
19.2). Years ago explorers of a remote island in the southwestern Pacific Ocean 
discovered a group of enormous stone figures. The figures displayed the 
distinctive shape of human faces. These figures left no doubt as to their ultimate 
origin in thought. Nevertheless, archeologists still don’t know the exact means 
by which they were carved or erected. The ancient head carvers might have used 
metallic hammers, rock chisels, or lasers for that matter. Though archaeologists 
lack the evidence to decide between various hypotheses about how the figures 
were constructed, they can still definitely infer that intelligent agents made them. 
In the same way, we can infer that an intelligence played a causal role in the 
origin of the Cambrian animals, even if we cannot decide what material means, 
if any, the designing intelligence used to transmit the information, or shape 
matter, or impart its design ideas to living form. Although the theory of 
intelligent design infers that an intelligent cause played a role in shaping life’s 
history, it does not say how the intelligent cause affected matter. Nor does it have 
to do so. 





FIGURE 19.2 
Group of carvings of giant heads, called “Moais,” on Easter Island. Courtesy 
iStockphoto/Think-stock. 


There is a logical reason we cannot without further information determine the 
mechanism or means by which the intelligent agent responsible for life 
transmitted its design to matter. We can infer an intelligent cause from certain 
features of the physical world, because intelligence is known to be a necessary 
cause, the only known cause, of those features. That allows us to infer 
intelligence retrospectively as a cause by observing its distinctive effects. 
Nevertheless, we cannot establish a unique scenario describing how the 
intelligent agent responsible for life arranged or impressed its ideas on matter, 
because there are many different possible means by which an idea in the mind of 
an intelligent agent could be transmitted or instantiated in the physical world. 

There is another even more profound reason that intelligent design—indeed, 
science itself—may not be able to offer a completely mechanistic account of the 
instantiation of thought into matter. Robert Asher worries about how “a 
biological phenomenon, even if designed,” could be “simply willed into 
existence without an actual mechanism.” In Asher’s understanding, the 
uniformitarian principle asks for a precedent, a known cause that not only 
generates information, but translates immaterial thought into material reality, 
impressing itself on and shaping the physical world. Asher complains that the 
argument for intelligent design cannot cite such a precedent and is thus “anti- 


uniformitarian.” 

Yet a precedent comes very readily to mind, an intimately familiar one for us 
all. At present no one has any idea how our thoughts—the decisions and choices 
that occur in our conscious minds—affect our material brains, nerves, and 
muscles, going on to instantiate our will in the material world of objects. 
However, we know that is exactly what our thoughts do. We have no 
mechanistic explanation for the mystery of consciousness, nor what is called the 
“mind-body problem”—the enigma of how thought affects the material state of 
our brains, bodies, and the world that we affect with them. Yet there is no doubt 
that we can—as the result of events in our conscious minds called decisions or 
choices—“will into existence” information-rich arrangements of matter or 
otherwise affect material states in the world. Professor Asher did this when he 
wrote the chapter in his bbok—representing his ideas impressed as words onto a 
material object, a printed page—attempting to refute intelligent design. I am 
doing this right now. This example, representative of countless daily experiences 
in life, surely satisfies the demands of uniformitarianism. 

Though neuroscience can give no mechanistic explanation for consciousness 
or the mind-body problem, we also know that we can recognize the product of 
thought, the effect of intelligent design, in its distinctive information-rich 
manifestations. Professor Asher recognized evidence of thought when he read 
the text in my book; I did so when I read his; you are doing so right now. Thus, 
even though it remains entirely possible that we may never know how minds 
affect matter and, therefore, that there may always be a gap in our attempt to 
account for how a designing mind affected the material out of which living 
systems were formed, it does not follow that we cannot recognize evidence of 
the activity of mind in living systems. 


WHY IT MATTERS FOR SCIENCE 


But if proponents of intelligent design admit that they do not, or perhaps even 
cannot, answer the question of how the mind responsible for the design of 
animal life impressed its ideas on matter, why does it matter that we recognize 
the evidence for intelligent design at all? If intelligent design just replaces one 
mystery with another, why not limit ourselves to materialistic explanations after 
all, as methodological naturalism requires, and be content with accepting the 
mystery we already have? Wouldn’t that be simpler and more intellectually 
economical? 

Perhaps. But it puts the mystery in the wrong place. We do know of a cause 
that can produce the functional information necessary to build complex systems. 
But we do not know exactly how mind interfaces with matter. If we were to ask 
what caused the Rosetta Stone to arise, and then insist despite all evidence to the 
contrary that a purely material process is capable of producing the information- 
rich etchings on that stone, we would be deluding ourselves. The information 
etched into that black slab of igneous rock at the British Museum provides 
overwhelming evidence that an intelligent agent did cause those inscriptions. 
Any rule that prevents us from considering such an explanation diminishes the 
rationality of science, because it prevents scientists from considering a possibly 
—and in this case obviously—true explanation. And the truth matters, not least 
in science. For this reason, the “rules of science” should not commit us to 
rejecting possibly true theories before we even consider the evidence. But that is 
exactly what methodological naturalism does. 

Moreover, adhering to methodological naturalism and refusing to consider the 
evidence for intelligent design in life does not just affect the explanations that we 
are willing to consider for the origins and history of life. They also affect the 
questions we ask about life as it exists, and thus the entire biological research 
agenda that we pursue. 

An analogy to a human artifact again shows why. If we ask exactly how the 
scribe responsible for the inscriptions on the Rosetta Stone accomplished his or 
her task—with a metal chisel, a sharpened piece of obsidian, a diamond stylus, 
or some other material means—we may not have enough evidence to answer that 
question. Nevertheless, it will help archaeologists to know that they are looking 
at an artifact of intelligence, rather than a byproduct of strictly natural processes. 
This will lead them to ask other more relevant questions about the stone, such as: 


“What do the inscriptions mean?” “Who wrote them?” and “What do they tell us 
about the surrounding cultures at the time?” In a similar way, what we think 
about how animal life arose and developed will lead us to ask different questions 
about living forms—dquestions that we might never think to ask if we were 
assuming that they had arisen by a purely undirected mechanism such as natural 
selection. 

Intelligent agents and natural selection do their work very differently. The 
mutation and selection mechanism is a blind, trial-and-error process, one that 
must maintain or optimize functional advantage through a series of incremental 
steps. Given Darwinian assumptions, we would not expect to see structures or 
systems in living organisms that required foresight. Nor would we expect to see 
structures that needed to be produced all at once in large jumps rather than by a 
series of function-preserving incremental steps. We would, however, expect to 
see evidence of a trial-and-error process in the genomes of organisms. 

But what happens if we open ourselves to the possibility of detecting design 
in life? We know a lot about how intelligent designers do their work. Intelligent 
designers use many established design strategies (or “design patterns,” as 
engineers would say). They also have foresight that allows them to reach 
functional goals without the need to maintain function through a series of 
intermediate structures. They typically engineer new systems from scratch 
without relying on random, incremental, trial-and-error modifications in one 
system to produce another. 

Because these two different types of causes operate differently and often 
produce different types of structures and systems, scientists should expect living 
systems (and the history of life) to look differently depending upon which type 
of cause produced the organisms or structures in question. And these differing 
perspectives and expectations can lead scientists to ask different research 
questions and make different predictions about what we should find in the 
structure of life itself. 


THE ENCODE PROJECT AND AN ID PREDICTION 


In 2012, a dramatic confirmation of one such prediction made by advocates of 
intelligent design occurred in the field of genomics. Three leading science 
journals, Nature, Genome Research, and Genome Biology, published a series of 
groundbreaking papers reporting on the results of a massive study of the human 
genome called the ENCODE project (short for Encyclopedia of DNA 


Elements).4° The conclusion: at least 80 percent of the genome performs 
significant biological functions, “dispatching the widely held view that the 
human genome is mostly ‘junk DNA.’ ”4! 

The discovery challenged a long held neo-Darwinian interpretation of the 
genome. According to neo-Darwinism, the genome as a whole should display 
evidence of the random trial-and-error process that gave rise to new genetic 
information. The discovery in the 1970s that only a small percentage of the 
genome contains information for building proteins was hailed at the time as 
powerful confirmation of the Darwinian view of life. The noncoding regions of 
the genome were assumed to be nonfunctional detritus of the trial-and-error 
mutational process—the same process that produced the functional code in the 
genome. As a result, these noncoding regions were deemed “junk DNA,” 
including by no less a scientific luminary than Francis Crick.‘ 

Because intelligent design asserts that an intelligent cause produced the 
genome, design advocates have long predicted that most of the nonprotein- 
coding sequences in the genome should perform some biological function, even 
if they do not direct protein synthesis. Design theorists do not deny that 
mutational processes might have degraded some previously functional DNA, but 
we have predicted that the functional DNA (the signal) should dwarf the 
nonfunctional DNA (the noise), and not the reverse. As William Dembski, a 
leading design proponent, predicted in 1998, “On an evolutionary view we 
expect a lot of useless DNA. If, on the other hand, organisms are designed, we 
expect DNA, as much as possible, to exhibit function.”*° 

The ENCODE project and other recent research in genomics have confirmed 
this prediction. As the lead article in Nature reported, ENCODE has “enabled us 
to assign biochemical functions for 80 percent of the genome in particular 
outside of the well-studied protein-coding regions.”“+ Other research in 
genomics has shown that, overall, the noncoding regions of the genome function 
much like the operating system in a computer. Indeed, the noncoding regions of 
the genome direct the timing and regulate the expression of the data modules or 
coding regions of the genome, in addition to possessing myriad other 
functions.*° Before ENCODE, neo-Darwinists would often ask: If the 
information in DNA provides such compelling evidence for the activity of a 
designing intelligence, why is over 90 percent of the genome composed of 
functionless nonsense sequences? The latest genomics research now provides a 
ready answer to this question: it isn’t. 

The significance of these discoveries in genomics to the debate about design 


has passed largely unnoticed in the media. But repeated attempts to stigmatize 
the ENCODE researchers as aiding and abetting “intelligent design creationists” 
have inadvertently highlighted what is at stake. In this effort, a biochemist at the 
University of Toronto, Laurence A. Moran, emerged as point man. The Moran 
strategy centered on tarring scientists and science journalists who publicized 
ENCODE and its implications with the brush of “Intelligent Design 
Creationism”—an all too familiar conflation of intelligent design with a very 
different idea, the biblical literalism of young-earth creationism. When the 
distinguished journal Science selected ENCODE as one of the top ten science 
news stories of 2012, reminding readers that it had detonated the notion of junk 
DNA by revealing overwhelming functionality in the genome,*° Moran jeered, 
“Oh well, I guess I’ll just have to be content to point out that many scientists are 
as stupid as many Intelligent Design Creationists!”*’ In the science world, as in 
the media, “creationist” is a dirty word; it’s like calling someone a Communist 
used to be in the 1950s. Such attempts to stigmatize results that challenge a 
favored theory illustrate how an ideological monopoly in science can stifle 
inquiry and discussion. 

The demise of the idea of junk DNA illustrates too, in a more positive way, 
how a competing perspective can inspire research that contributes to new 
discovery. Although clearly not every scientist who performed research helping 
to establish the functional significance of nonprotein-coding DNA was inspired 
by the theory of intelligent design, at least one noteworthy scientist was. During 
the early part of the decade, before ENCODE made the headlines, this scientist 
published many articles challenging the idea of junk DNA based on genomics 
research that he was conducting at the National Institutes of Health. After the 
publication of ENCODE in 2012, his coauthor on many of those articles, the 
prominent University of Chicago geneticist James Shapiro, wrote an article in 
the Huffington Post commending the scientist for his groundbreaking research 
and for anticipating the ENCODE results years before. In the article, Shapiro 
acknowledged that he and his coauthor had “different evolutionary 
philosophies”—his charitable way of referring to his coauthor’s growing interest 
in the theory of intelligent design. 

Who was that other scientist? None other than Richard Sternberg, the 
evolutionary biologist who was punished for his openness to intelligent design 
while serving at the Smithsonian Institution (and the National Institutes of 
Health) in 2004. Around that time, Sternberg’s doubts about neo-Darwinism and 
his growing interest in intelligent design led him to consider the possibility that 


the majority of the genome could really be functional.*° His research 
subsequently confirmed what was for him, an idea inspired in part by intelligent 
design. 

In Signature in the Cell, I described many other discriminating predictions of 
the theory of intelligent design—predictions that differ from those of competing 
materialistic evolutionary theories—and how those predictions might help to 
guide new research in various subdisciplines of biology, including some in 
medicine. These predictions may also lead scientists to make new discoveries— 
discoveries that proponents of a competing perspective might not have been 
inclined to make—or to accept. 


OPEN VISTAS 


By now it should be clear why so many brilliant scientists have missed the 
evidence of design in the Cambrian explosion. Scott Todd, a biologist writing in 
Nature, succinctly stated the reason: “Even if all the data point to an intelligent 
designer, such a hypothesis is excluded from science because it is not 
naturalistic.”*? When scientists decide by fiat that intelligent design lies beyond 
the bounds of science, their decision will prevent them from considering this 
possibly or probably true explanation for the origin of animal form. But it will 
also deprive them of a new perspective that can generate new research questions 
and foster new avenues of discovery. Knowing this helps solve the final mystery 
of this book, but it also suggests a more productive way forward to address 
mysteries yet unsolved. Scientists committed to methodological naturalism have 
nothing to lose but their chains—fetters that bind them to a creaky and exhausted 
nineteenth-century materialism. The future lies open before them, and us. As we 
in the intelligent design research community like to say, let’s break some rules 
and follow the evidence wherever it leads. 


20 
What’s at Stake 


In the summer of 2002, I had the opportunity to hike up to the Burgess Shale 
with a group of geologists, geophysicists, and marine biologists. Our group also 
included my then eleven-year-old son and a teenage friend of his who was 
interested in the Cambrian fossils and the debate about Darwinism and design. 

When we got to the top of the mountain, I was unprepared for the impact the 
fossils would have on me. I had seen many fossils before, of course. But seeing 
these fossils—marine animals from the dawn of animal life at the top of a 
mountain with their beautifully preserved appendages and organs—rendered the 
idea of the “Cambrian explosion” a good deal less theoretical for me than it had 
been. These complex sea creatures, now brushed by the thin air at an elevation of 
7,000 feet in the middle of the Canadian Rockies, had apparently arisen 
suddenly, almost from nothing by way of ancestral forms, in the sedimentary 
record. Everything about them cried out for a story—a big story. It set my mind 
and imagination racing (see Fig. 20.1). 

As wonderful as the fossils were, our trip to see them was made more 
memorable by two things that happened en route—one on the way up the 
mountain and one on the way down. As we were making our ascent, crossing a 
large talus slope—a section of the mountain void of vegetation and covered with 
only fragments of sedimentary rock—I heard my son unexpectedly call out to 
me from up at the front of our group. His voice had a trembling quality. I looked 
forward to see him, normally a fearless kid blessed by energy without bound, 
standing locked in place, pale and wide-eyed. I stepped around several of the 
other hikers on the trail to catch up with him. It turned out he was experiencing a 
kind of vertigo, though the mountain was not dangerously steep at that point (see 
Fig. 20.2). As he set out across the path that cut through the rocky slope, he had 
made the mistake of looking down the mountain. Without trees as a reference 
point and with hundreds of feet of loose rock fragments above and beneath him, 
he became disoriented and frightened. I steadied him as we walked in step, stride 


for stride, with me directly behind him across that open stretch of the mountain. 
Before long we were back to a place on the trail where trees and other plants 
appeared, providing a steadying presence as a point of reference. My son’s 
perspective quickly returned. He relaxed and soon was smiling and leaping 
confidently ahead of me again. 

On the way down the mountain, I had a striking interaction with a member of 
our group, who gave voice to a different kind of disorientation. It began as a 
conversation between my son’s friend and our official field guide, who had been 
assigned to us by the local Burgess Shale Geoscience Foundation. Our guide was 
a paleontologist and did a terrific job. He told many a fine story about the 
geological history of the surrounding formations, about the discovery of the 
fossils, and of course about the evolutionary history of animal life. In fact, just 
before we turned the final comer on the trail to ascend to a large collection of 
excellent fossils available for viewing at the top of the mountain, he slipped a 
statement of support for evolutionary orthodoxy into his description of the fossil 
site. Our guide was clearly unaware that many of us in the group knew the 
fossils we were about to see challenged the standard Darwinian story. 











FIGURE 20.1 
A trilobite fossil found at the Burgess Shale. Courtesy Michael Melford/National 
Geographic Image Collection/Getty Images. 
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FIGURE 20.2 
Figure 20.a (left): Photograph of the author and his son Jamie at the Walcott 
Quarry of the Burgess Shale in British Columbia, Canada. Figure 20.2b (right): 
Pausing for a moment to reflect on the slope below the Burgess Shale outcrop. 


We hadn’t made an issue of our views, of course, but nearly all the scientists 
on the hike were skeptical of neo-Darwinism. Paul Chien, the University of San 
Francisco marine biologist who had worked with J. Y. Chen in China on the 
sponge embryo fossils, was on the trip, and he had more than a passing 
acquaintance with Cambrian-era paleontology—as did several of the Canadian 
geologists with us. Still, not wanting to introduce any needless discord, we 
carefully avoided engaging the issue. We just wanted to see the fossils. 

As we descended the mountain, however, my son’s young friend asked our 
guide how he squared what we had just seen with his support for Darwinian 
evolution. The guide at first maintained his commitment to the Darwinian party 
line. He said he thought that Darwin would feel “vindicated” by the discovery of 
the Burgess fossils. This proved too much for the precocious, intellectual 
teenager, who loudly blurted out, “What?! Darwin would feel vindicated? By the 
sudden appearance of all those animals without any ancestors in the fossil 
record! Are you kidding?!” 

You would have to know this endearing young man to understand how his 
uninhibited outburst only charmed and amused our guide. But fortunately it did. 
The rest of us, however, were initially mortified. This was the discussion we 


were trying to avoid, knowing exactly the intense emotions it often provokes. 
With scientists, it is generally safer to discuss religion and politics. 

Nevertheless, to his credit, our guide took the challenge in stride. He 
explained how the Burgess fossils demonstrated evidence of change over time, 
how the rock column showed the great age of the earth, and how the discovery 
of the fossils high on a mountain revealed the evolution of the planet. Our young 
friend had spent too much time reading up on the subject to let the point go at 
that. He brushed aside the issue of the earth’s age, which like our guide he 
reckoned in billions of years, and assured the man that he accepted evidence for 
change over time in the sedimentary record. He didn’t question evolution in that 
sense. He questioned Darwinian evolution. “Where is the evidence of gradual 
change?” he demanded, as his teenage voice cracked with excitement as it 
moved into the upper register. He continued, “What mechanism could produce 
so many new animals so quickly?” 

An odd thing happened then. The paleontologist now leading us down the 
trail suddenly ceased to act the part of “guide.” He dropped any pretense of 
superior authority and said, “You know, I’ve wondered about that myself.” I 
thought I could hear in his voice the candid amazement of the fourteen-year-old 
boy he once was. 

“How do you explain it?” he asked my son’s friend. 

Our young spokesman confidently piped up and asserted, “Intelligent design, 
of course!” 

At which point, our guide began to ask the probing questions. Soon my son’s 
friend had exhausted his store of knowledge and began to look to me to join the 
conversation. I did so, reluctantly at first. I explained the information argument 
for intelligent design and how the Cambrian explosion contributed to it. Our 
guide asked me the hard questions: How can we detect design? Is intelligent 
design science? Aren’t we just arguing from ignorance and giving up on science, 
or at least mainstream evolutionary science, too soon? He also wanted to know 
who I personally thought the designer was. His challenges were tough and 
honest. A terrific conversation ensued. 

When we reached the trailhead, he surprised me, thanking me for the 
conversation—and thanking my young friend for starting it. He then spoke a bit 
more personally and revealed that he sometimes found thinking about biological 
origins disturbing. He said that as a scientist he was committed to the 
evolutionary perspective. But he also found its denial of purpose depressing. He 
wondered if there was some way to affirm both science and the kind of purpose 


and meaning in life that religion speaks about. As we parted ways, he said he 
would like to learn more about intelligent design. He told me he was intrigued 
by the perspective we were developing. I felt that we had made a genuine human 
connection rather than, as sometimes happens in the evolution debate, merely 
flinging assertions at one another. 

Over the years, as I’ve researched and thought about biological origins, I 
have had numerous similar conversations with people of many persuasions and 
backgrounds: religious and nonreligious; scientists, engineers, medical doctors; 
businessmen and -women, appliance repairmen and taxi drivers. These 
conversations usually start innocently enough as the result of someone asking 
me what I do for a living. Though I often euphemize my response (“I work for a 
research organization”) to avoid getting trapped in a heavy conversation on an 
airplane or over a broken dishwasher, often the conversations come whether I 
want them or not. People are interested in how life began and they instinctively 
understand that whatever theory we adopt has larger philosophical, religious, or 
worldview implications. People are usually energized by considering those larger 
implications and questions. Many would like to find a way to harmonize the 
evidence from science with a view of the world that addresses their deepest 
existential longings as human beings, their yearning for purpose and 
significance. But like our guide, many have been frustrated by the difficulty of 
arriving at a coherent synthesis. 

It’s not hard to see why. On the one hand, many people of faith have little real 
interest in what science has to say about life’s origins. Indeed, many well- 
meaning religious believers have adopted a view of the relationship between 
science and faith that rejects the testimony of science as irrelevant or even 
dangerous and affirms that just reading the Bible will give all the insight needed 
to understand how life came to be. Their approach does not really attempt to 
harmonize faith and science, since it takes faith in the Bible, and often a 
particular interpretation of the Bible, as the only reliable source of information 
about life’s origin. 

On the other hand, many scientists and others who think that science has 
something to teach us about the big questions have started by assuming the neo- 
Darwinian account of biological origins, despite its many scientific difficulties— 
and despite its denial of any role for purposive intelligence in the history of life. 

In particular, two popular ideas about how Darwinism informs worldview 
have come to different conclusions about the worldview it affirms—or allows. 
The first view, the “New Atheism,” has been articulated by spokesmen such as 


Richard Dawkins in his book The God Delusion and the late Christopher 
Hitchens in God Is Not Great.' It purports to refute the existence of God as “a 
failed hypothesis,”* as another New Atheist book puts it. Why? Because, 
according to Dawkins and others, there is no evidence of design in nature. 
Indeed, Dawkins’s argument for atheism hinges upon his claim that natural 
selection and random mutation can explain away all “appearances” of design in 
nature. And since, he asserts, the design argument always provided the strongest 
argument for believing in God’s existence, belief in God, he concludes, is 
extremely improbable—tantamount to “a delusion.” For the New Atheists, 
Darwinism makes theistic belief both implausible and unnecessary. As Dawkins 
has famously put it, “Darwin made it possible to be an intellectually fulfilled 
atheist.”° 

The New Atheists took the publishing world by storm in 2006 when The God 
Delusion first appeared. But nothing about the “New” Atheism was actually 
“new.” Instead, it represents a popularization of a science-based philosophy, 
called scientific materialism, that came into currency among scientists and 
philosophers during the late nineteenth century in the wake of the Darwinian 
revolution. For many scientists and scholars at the time, a scientifically informed 
worldview was a materialistic worldview in which entities such as God, free 
will, mind, soul, and purpose played no role. Scientific materialism, following 
classical Darwinism, denied evidence of any design in nature and, therefore, any 
ultimate purpose to human existence. As British philosopher and mathematician 
Bertrand Russell put it early in the twentieth century, “Man is the product of 
causes which had no prevision of the end they were achieving” and which 
predestine him “to extinction in the vast death of the solar system.”4 

An alternative and increasingly popular view is known as theistic evolution. 
Popularized by Christian geneticist Francis Collins in his book The Language of 
God (also published in 2006),° this perspective affirms the existence of God and 
the Darwinian account of biological origins. Yet it provides few details about 
how God might or might not influence the evolutionary process, or how to 
reconcile seemingly contradictory claims in the Darwinian and Judeo-Christian 
accounts of origins. For example, Collins has declined to say whether he thinks 
God in any way directed or guided the evolutionary process, though he affirms 
neo-Darwinism, which specifically denies that natural selection is guided in any 
way. Darwinism and neo-Darwinism insist that the appearance of design in 
living organisms is an illusion because the mechanism that produces that 
appearance is unguided and undirected. Does God, in Collins’s view, guide the 


unguided process of natural selection? He, and many other theistic evolutionists, 
don’t say. This ambiguity has made an uneasy reconciliation of science and faith 
possible, but it has also left many questions unanswered. In fairness, many 
theistic evolutionists would argue that not all such questions can be answered, 
because science and faith occupy separate, non-overlapping realms of inquiry, 
knowledge, and experience. But that answer itself underscores the limits of the 
harmonization of science and faith that Collins and others holding his view has 
achieved. 

The argument of this book presents a scientific challenge to both of these 
views. In the first place, the evidence and arguments we have seen show that the 
scientific premise of the New Atheist argument is flawed. The mechanism of 
mutation and natural selection does not have the creative power attributed to it 
and, thus, cannot explain all “appearances” of design in life. The neo-Darwinian 
mechanism does not explain, for example, either the new genetic or epigenetic 
information necessary to produce fundamentally new animal body plans. 

This book has presented four separate scientific critiques demonstrating the 
inadequacy of the neo-Darwinian mechanism, the mechanism that Dawkins 
assumes can produce the appearance of design without intelligent guidance. It 
has shown that the neo-Darwinian mechanism fails to account for the origin of 
genetic information because: (1) it has no means of efficiently searching 
combinatorial sequence space for functional genes and proteins and, 
consequently, (2) it requires unrealistically long waiting times to generate even a 
single new gene or protein. It has also shown that the mechanism cannot produce 
new body plans because: (3) early acting mutations, the only kind capable of 
generating large-scale changes, are also invariably deleterious, and (4) genetic 
mutations cannot, in any case, generate the epigenetic information necessary to 
build a body plan. Thus, despite the commercial success of The God Delusion 
and its wide cultural currency, the New Atheist philosophy lacks credibility 
because it has based its understanding of the metaphysical implications of 
modern science on a scientific theory that itself lacks credibility—as even many 
leading evolutionary biologists now acknowledge.° 

Second, this book poses a strong challenge to theistic evolutionists such as 
Francis Collins for many of the same scientific reasons. Collins places great trust 
in modern Darwinism as the unifying theory of biology, but seems completely 
unaware of the formidable scientific problems now afflicting the theory—in 
particular, the challenges to the creative power of the natural selection/mutation 
mechanism. He makes no attempt to address or answer any of these challenges. 


In addition, many of his arguments for universal common descent—the defense 
of which was his main concern in The Language of God—are based upon the 
alleged presence of nonfunctional or “junk” elements in the genomes of different 
organisms. Though the theory of intelligent design, which Collins says he 
opposes, does not necessarily challenge this part (common descent) of 
Darwinian theory, the factual basis of his arguments has now also largely 
evaporated as the result of ENCODE and other developments in genomics.’ 
Thus, this popular view of biological origins, and its conception of God’s 
relationship to the natural world, now stands starkly at odds with the evidence. 
But why attempt to reconcile traditional Christian theology with Darwinian 
theory, as Collins tries to do, if the theory itself has begun to collapse? 

The perspective of this book offers a potentially more coherent and satisfying 
way of addressing the big questions, of synthesizing science and metaphysics (or 
faith), than either of the currently popular views on offer. The Cambrian 
explosion, like evolutionary theory itself, raises larger worldview questions 
precisely because it raises questions of origins and of design, and with them, the 
question that all worldviews must address: What is the thing or the entity from 
which everything comes? But unlike strict Darwinian materialism and the New 
Atheism built atop it, the theory of intelligent design affirms the reality of a 
designer—a mind or personal intelligence behind life. This case for design 
restores to Western thought the possibility that human life in particular may have 
a purpose or significance beyond temporary material utility. It suggests the 
possibility that life may have been designed by an intelligent person, indeed, one 
that many would identify as God. 

Unlike the theistic evolution of Francis Collins, however, the theory of 
intelligent design does not seek to confine the activity of such an agency to the 
beginning of the universe, conveying the impression of a decidedly remote and 
impersonal deistic entity. Nor does the theory of intelligent design merely assert 
the existence of a creative intelligence behind life. It identifies and detects 
activity of the designer of life, and does so at different points in the history of 
life, including the explosive show of creativity on display in the Cambrian event. 
The ability to detect design makes belief in an intelligent designer (or a creator, 
or God) not only a tenet of faith, but something to which the evidence of nature 
now bears witness. In short, it brings science and faith into real harmony. 

Just as importantly, perhaps, the case for design supports us in our existential 
confrontation with the void and the seeming meaninglessness of physical 
existence—the sense of survival for survival’s sake that follows inexorably from 


the materialist worldview. Richard Dawkins and other New Atheists may find it 
untroubling, even amusing and certainly profitable, to muse over the prospect of 
a universe without purpose. But for the vast majority of thoughtful people, that 
idea is tinged with terror. Modern life suspends many of us, so we feel, high over 
a chasm of despair. It provokes feelings of dizzying anxiety—in a word, vertigo. 
The evidence of a purposeful design behind life, on the other hand, offers the 
prospect of significance, wholeness, and hope. 

As my son walked out across the mountain high above the Yoho Valley, he 
was surrounded by many slabs of rock containing some of the very fossils we 
had come to see. But as he surveyed that barren portion of landscape, he lost 
perspective on where he was and what he had come to do. Without landmarks or 
steadying points of reference, he felt as if he were lost in a sea of sensory 
impressions. Without his sense of balance, he feared even to take a step. He 
called out for his father. 

It occurred to me only much later how closely his experience parallels our 
own as human beings trying to make sense of the world around us. To gain a true 
picture of the world and our place in it we need facts—empirical data. But we 
also need perspective, sometimes called wisdom, the reference points that a 
coherent view of the world provides. Historically, that wisdom was provided for 
many men and women by the traditions of Western monotheism—by our belief 
in God. The theory of intelligent design generates both excitement and loathing 
because, in addition to providing a compelling explanation of the scientific facts, 
it holds out the promise of help in integrating two things of supreme importance 
—-science and faith—that have long been seen as at odds.® 

The theory of intelligent design is not based upon religious belief, nor does it 
provide a proof for the existence of God. But it does have faith-affirming 
implications precisely because it suggests the design we observe in the natural 
world is real, just as a traditional theistic view of the world would lead us to 
expect. Of course, that by itself is not a reason to accept the theory. But having 
accepted it for other reasons, it may be a reason to find it important. 
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Plate 1. Mt. Maotian, near Chengjiang, China. The ‘famous Chengjiang fossil 
site was discovered near the top of this mountain. Courtesy Illustra Media (all). 
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Plate 2 _ The original discovery site of the first Chengjiang fossils. Today this 
area is a preserve. 





Plate 3. Looking down from Mt. Maotian, with the large Fuxian Lake in the 
distant foreground. 





Plate 4. The opabinid Jianfengia multisegmentalis. Courtesy Paul Chien. Unless 
otherwise noted, all fossils shown on this and subsequent color pages are from 
the Maotianshan Shale near Chengjiang, China. 





Plate 5. A fossilized comb jelly (a ctenophore). Species: Maotianoascus 
octonarius. Courtesy J. Y. Chen. 
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Plates 6 & 7. Priapulid worms of the species Maotianshania cylindrica. 
Courtesy Paul Chien and Illustra Media. 





Plate 8. A lobopodian “worm with legs” of the genus Microdictyon. 







Plate 9. Another lobopod worm. Species: Microdictyon sinicum. 
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Plate 10. Luolishania longicruris, a rare lobopod of the M 


aotianshan Shale. 


Plate 11. The extinct and enigmatic ‘Vetulicolia, thought to represent either an 
arthropod, a tunicate-like urochordate, or a distinct phylum. Courtesy Illustra 
Media (all). 





Plates 12 & 13. Two specimens of the arthropod Leanchoilia. The top specimen 
shows its distinctive long frontal whiplike appendages. 





Plate 14. Chengjiangocaris longiformis (below), a rare arthropod known only 
from the Chengjiang biota. Courtesy Illustra Media (all). 

Two fossils from the dark-colored Burgess Shale, near the town of Field in 
British Columbia, Canada. Note the contrast in color to the fossils from the 
honey-colored Maotianshan Shale depicted on the previous and subsequent 


pages. 
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Plate 15. A fossil of the arthropod Marrella from the Burgess Shale. Courtesy 
Illustra Media. 





Plate 16 (below). The shrimp-like arthropod Waptia. Courtesy Illustra Media. 





Plate 17. The arthropod Fuxianhuia protensa. Note its exquisitely preserved 
eyes, antennae, segmented body, and forked tail. Courtesy Illustra Media. 





Plate 18. A fossil of the arthropod Waptia, from the honey-colored Maotianshan 
Shale. Courtesy Paul Chien. 


St ES rap, - 
Plate 19. A fossil of an Anomalocaris, whose large eyes and jaws made it the 
fiercest predator in the Cambrian seas. Some reached a length of one meter. 
Courtesy J. Y. Chen. 





Plates 20-22. A montage of beautifully preserved trilobites from the 
Maotianshan Shale, including a specimen of Eoredlichia intermedia (left). 
Courtesy Illustra Media (all). 





Plate 23. A specimen of the trilobite Kuanyangia pustulosa from the 
Maotianshan Shale near Chengjiang, China. Courtesy Illustra Media. 
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